09786635 Results 



SEQ ID NO: 1 

SUMMARIES 

% 

Result Query 



Jo. 


Score 


Match 


Length DB 


ID 


Description 


1 


6880 


100, 


0 


6880 


6 


AX2 5 34 5 0 


AX253450 


Sequence 


2 


6880 


100 . 


, 0 


6880 


9 


HSA012376 


AJ012376 


Homo sapi 


3 


6880 


100 . 


.0 


7260 


6 


AX25 34 52 


AX253452 


Sequence 


4 


6879 


100 . 


. 0 


9497 


6 


AX059973 


AX059973 


Sequence 


5 


6879 


100 . 


. 0 


9497 


9 


AF165281 


AF165281 


Homo sapi 


6 


6875 . 8 


99 . 


. 9 


9741 


6 


AX127830 


AX127830 


Sequence 


7 


6875 . 8 


99 . 


. 9 


9741 


6 


AX139817 


AX139817 


Sequence 


8 


6875 .8 


99 - 


. 9 


9741 


6 


AX351038 


AX351038 


Sequence 


9 


6875 .8 


99 . 


. 9 


9854 


6 


AX12 7831 


AX127831 


Sequence 


10 


6875 .8 


99 , 


. 9 


9854 


6 


AX139818 


AX139818 


Sequence 


11 


6870.4 


99 . 


. 9 


7860 


6 


AX092594 


AX092594 


Sequence 


12 


6865 


99. 


.8 


9495 


6 


AX059978 


AX059978 


Sequence 


13 


6862 . 4 


99 . 


.7 


10442 


6 


AX060713 


AX060713 


Sequence 


14 


6862 .4 


99 . 


.7 


10442 


6 


AX060892 


AX060892 


Sequence 


15 


6862 .4 


99 . 


.7 


10442 


9 


AF285167 


AF285167 


Homo sapi 


16 


6860.8 


99 . 


.7 


10474 


6 


AX060719 


AX060719 


Sequence 


17 


6860.8 


99 . 


.7 


10474 


6 


AX060721 


AX060721 


Sequence 


18 


6860.8 


99 . 


,1 


10474 


6 


AX060898 


AX060898 


Sequence 


19 


6860.8 


99 . 


.7 


10474 


6 


AX060900 


AX060900 


Sequence 


20 


6845 .2 


99. 


.5 


7862 


6 


AX135712 


AX135712 


Sequence 


21 


6760.2 


98. 


.3 


9593 


6 


AX059976 


AX059976 


Sequence 


22 


6711.6 


97. 


.6 


6786 


9 


AB055982 


AB055982 


Homo sapi 


23 


5630 


81. 


.8 


7878 


10 


MMABCl 


X7 592 6 Mus musculu 


24 


5455.6 


79. 


.3 


6801 


10 


AY2 08182 


Ay2 08182 Rattus no 


25 


4302 


62 . 


.5 


7074 


5 


AF362377 


AF362377 


Gallus ga 


26 


2052 .4 


29. 


.8 


5097 


6 


BD012346 


BD012346 


Genes rel 


27 


2052 .4 


29. 


.8 


5097 


6 


BD160225 


BD160225 


Primer fo 


28 


2052 .4 


29. 


.8 


5097 


9 


AK027864 


AK027864 


Homo sapi 


29 


1911.6 


27. 


.8 


7298 


9 


AF001945 


AF001945 


Homo sapi 



AF165281 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



AF165281 9497 bp mRNA linear PRI 17-AUG-1999 

Homo sapiens ATP cassette binding transporter 1 (ABCl) raRNA, 
complete cds . 
AF165281 

AF165281.1 GI:5734100 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 9497) 

Rust,S., Rosier, M., Funke,H., Real, J., Amoura,Z., Piette^J.C, 

Deleuze, J.F. , Brewer, H.B., Duverger,N., Denefle,?. and Assmann,G. 

Tangier disease is caused by mutations in the gene encoding 

ATP-binding cassette transporter 1 

Nat. Genet. 22 (4), 352-355 (1999) 

99364413 

10431238 

2 (bases 1 to 9497) 

Rust,S., Rosier, M., Funke,H., Real, J., Amoura,Z., 
Deleuze, J.F. , Brewer, H.B., Duverger,N., Denefle,P 
Direct Submission 

Submitted ( 01- JUL-1999) Genomics, Rhone-Poulenc Rorer 
Cr#mieux, Bvry 91006, France 

Location/Qualifiers 

1. .9497 

/organisms "Homo sapiens" 
/mol_type= "mRNA " 



Piette, J.C. , 
and Assmann,G. 

2 rue Gaston 



/db_xref="taxon: 9606" 

/ chromosome^ " 9 " 

/map^»9q31" 
gene 1. .94 97 

/gene="ABCl" 
CDS 121. .6726 

/gene="ABCl" 

/note="ABC transporter; ABCl protein" 
/codon_start=l 

/products "ATP cassette binding transporter 1" 
/protein_id= " AAD4 9849.1" 
/db_xref="GI : 5734101" 

/translation="MPSAGTLPWVQGIICNANNPCFRyPTPGEAPGWGNFNKSIVAR 
LFSDARRLLLYSQKDTSMKDMRKVLRTLQQ I KKSSSNLKLQDFLVDNETFSGFLYHNL 
SLPKSTVDKMLRADVILHKVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPRE 
KLAAAERVLRSN^roILKPILRTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSW 
SDMRQE VMFLTNVNS S S SSTQ I YQAVSRI VCGHPEGGGLK I KSLNWYEDNNYKALFGG 
NGTEEDAETFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPATRQ 
VMAEVNKTFQELAVFHDLEGMWE ELS P K I WTFMENSQEMDLVRMLLDSRDNDHFWEQQ 
LDGLDWTAQDI VAFLAKHPEDVQSSNGSVYTWREAFNETNQA I RT I SRFMECVNLNKL 
EPIATEVWLINKSMELLDERKFWAGIVFTGITPGSIELPHHVKYKIRMDIDNVERTNK 
IKDGYWDPGPRADPFEDMRYVWGGFAYLQDWEQA I IRVLTGTEKKTGVYMQQMPYPC 
YVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIVYEKEARLKETMRIMGLDNSILWFS 
WFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFVFLSVFAWTILQCFLISTLFSR 
ANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFASLLSPVAFGFGCEYFALFEEQ 
GIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFIiYGVMTWYI EAVFPGQYGI PRPWYF 
PCTKS YWFGEESDEKSHPGSNQKRI SE I CMEEEPTHLKLGVS IQNLVKVYRDGMKVAV 
DGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQN 
LGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPSSKLKSKTSQ 
LSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQGRTIILSTHH 
MDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSSST 
VSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHELTY 
VLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETSDG 
TLP ARRNRRA FGDKQS CLRP FTEDDAADPNDSD I DPESRETDLLSGMDGKGS YQVKGW 
KLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 
PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTT 
APVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADI 
LQDLTGRNI SDYLVKTYVQ II A KSLKNKI WVNE FRYGGFS LGVSNTQ ALP PSQE VNDA 
TKQMKKHLKLAKDS S ADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHA I S S FLNV INNA 
ILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVS ICVI FAMSFVPA 
SFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQKS 
YVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 
VLELFTDNKLNNINDI LKS VFL I FPHFCLGRGL IDMVKNQAMADALERFGENRFVS PL 
SWDLVGRNLFAMAVEGWFFHTVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRI 
LDGGGQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLT 
GDTTVTRGDAFLNRNS I LSN I HE VHQNNGYCPQ FDA I TELLTGREHVE FFALLRGVPE 
KEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMD 
PKARRFLWNCALSWKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNR 
FGDGYTIWRIAGSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFS 
I LS QS KKRLH lEDYS VSQTTLDQVFVNFA KDQSDDDHLKDLSLHKNQTWDVA VLTSF 
LQDEKVKESYV" 

BASE COUNT 2600 a 2115 c 2217 g 2564 t 1 others 

ORIGIN 



Query Match 100.0%; Score 6879; DB 9; Length 94 97; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 687 9; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 


60 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 




Db 


1 


CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 


60 


Qy 


61 


TCTGTTCGGCTGAGCTACCCACCCTATGAACAACAT-GAATGCCATrTTCCAAATAAAGCC 


12 0 




IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


61 


TCTGTTCGGCTGAGCTACCCACCCTATGAA CA A CATGAATGCCATTTTCCAAATAAAGCC 


12 0 


Qy 


121 


ATGCCCTCTGCAGGAA CA CTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACC CO 


180 




IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


121 


ATGCCCTCTGCAGGAA CA CTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACCCC 


180 


Qy 


181 


TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 


240 



Illllllllllllllllllllllllllllilllllllllillllllllllllllllllll 



Db 


181 


TGTTTC CGTTA CC CGA CTC CTGGGGAGGCTC C CGGAGTTGTTGGAAA CTTT AA C AAATCC 


zh\j 


Qy 


241 


ATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCrrTCTTTTATACAGCCAGAAAGACACC 


300 




llllllllllllllilllllllllllllllllllllllllllllllllllllllllllll 




Db 


241 


ATTGTGGCTCGCCTCTTCTC AGATGCTCGGAGGCTTCTTTTATA C A GCC AGA AAGA C A C C 


300 


Qy 


301 


AGCATGAAGGACATGCGCAAAGTTCTGAGAACATTACAGCAGATCAAGAAATCCAGCTCA 


360 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 




Db 


301 


AGC ATGAAGGA C ATGCGC AAAGTTCTGA GAA C ATTA C AGC AGATC AAGAAATCC A GCTC A 


36 0 


Qy 


361 


AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 


420 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


361 


AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 


42 0 


Qy 


421 


AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 


480 




IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


421 


AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 


48 0 


Qy 


481 


AAGGTATTTTTGCAAGGCTACCAGTTACATTTGACAAGTCTGTGCAATGGATCAAAATCA 


540 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


481 


AA GGT ATTTTTGC A AGGCTA C C A GTTA C A TTTGA C A AGTCTGTGC A ATGGATC A A AATC A 


54 0 


Qy 


541 


GAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 


600 




IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 




Db 


541 


GAAGAGATGATTC AA CTTGGTGA C CAAGA AGTTTCTGAGCTTTGTGGC CTAC C AAGGGAG 


600 


Qy 


601 


AAACTGGCTGCAGCAGAGCGAGTACTTCG1TCCAACATGGACATCCTGAAGCCAATCCTG 


660 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 




Db 


601 


AA A CTGGCTGC AGC AGAGCGA GTA CTTCGTTCC AA C ATGGA C AT C CTGA AGC C A A TC CTG 


660 


Qy 


661 


AGAACACTAAACTCTACATCTCCCTTCCCGAGCAAGGAGCTGGCCGAAGCCACAAAAACA 


720 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


661 


AGA A C A CTAA A CTCTA C ATCTC C CTTC C CGAGC A AGGAGCTGGC CGA A GC OA C A A AA A C A 


72 0 


Qy 


721 


TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 


780 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


721 


TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 


780 


Qy 


781 


ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 


840 




IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


781 


ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 


840 


Qy 


841 


TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCGAGGGAGGGGGGCTGAAGATCAAG 


900 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 




Db 


841 


TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCGAGGGAGGGGGGCTGAAGATCAAG 


900 


Qy 


901 


TCTCTCAACTGGTATGAGGA CAACAA CTACAAAGCCCTCTTTGGAGGCA ATGGCACTGAG 


960 






Mllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


901 


TCTCTC A A CTGGTATGAGGA C A A C AA CTA C AAA GC C CTCTTTGGA GGC A ATGGC A CTGA G 


960 


Qy 


961 


GAAGATGCTGAAACCTTCTATGACAACTCTACAACTCCTTACTGCAATGATTTGATGAAG 


1020 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


961 


GAAGATGCTGAAACCTTCTATGACAA CTCTA CAA CTCCTTACTGCAATGATTTGATGAAG 


1020 


Qy 


1021 


AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 


1080 




IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


1021 


AATTTGGAGTCTAGTCCTCITTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 


1080 


Qy 


1081 


GGGAAGATCCTGTATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGGTGAAC 


1140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 < 




Db 


1081 


GGGAAGATCCTGTATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGGTGAAC 


1140 


Qy 


1141 


AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 


1200 




IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 




Db 


1141 


AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 


1200 


Qy 


1201 


CCCAAGATCTGGACCTTCATGGAGAACAGCCAAGAAATGGACCTTGTCCGGATGCTGTTG 


1260 






llllllllllllllllllllllllllllllllllllillllllllillllllMIIIIII 




Db 


1201 


CCCAAGATCTGGACCTTCATGGAGAACAGCCAAGAAATGGACCTTGTCCGGATGCTGTTG 


1260 



Qy 


1261 


GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 


1320 




llllllilllMllllllllllllllllllillllllllllllllllillllllllllil 




Db 


1261 


GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGrrGGATGGCTTAGATTGGACAGCC 


1320 


Qy 


1321 


CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 


1380 




IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIillllllllllllllllllllllll 




Db 


1321 


CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 


1380 


Qy 


1381 


GTGTACA CCTGGAGAGAAGCTTTCAACGAGACTAACC AGGCAATCCGGA CCATATCTCGC 


1440 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIII 




Db 


1381 


GTGTACA CCTGGAGAGAAGCTTTCAA CGAGA CTAACCAGGCAATCCGGACCATATCTCGC 


1440 


Qy 


1441 


TTCATGGAGTGTGTCAACCTGAACAAGCTAGAACCCATAGCAACAGAAGTCTGGCTCATC 


1500 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


1441 


TTCATGGAGTGTGTCAACCTGAACAAGCTAGAACCCATAGCAACAGAAGTCTGGCTCATC 


1500 


Qy 


1501 


AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 


1560 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


1501 


AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 


1560 


Qy 


1561 


ATTACTCCAGGCAGCATTGAGCTGCCCCATCATGTCAAGTACAAGATCCGAATGGACATT 


1620 




IIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


1561 


ATTACTCCAGGCAGCATTGAGCTGCCCCATCATGTCAAGTACAAGATCCGAATGGACATT 


1620 


Qy 


1621 


GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 


1680 




IIIIIIIIMIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 




Db 


1621 


GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 


1680 


Qy 


1681 


GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 


1740 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 




Db 


1681 


GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 


1740 


Qy 


1741 


GAGCAGGCAATCATCAGGGTGCTGACGGGCACCGAGAAGAAAACTGGTGTCTATATGCAA 


1800 






IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 




Db 


1741 


GAGCAGGCAATCATCAGGGTGCTGACGGGCACCGAGAAGAAAACTGGTGTCTATATGCAA 


1800 


Qy 


1801 


CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 


1860 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


1801 


CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 


1860 


Qy 


1861 


CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 


1920 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


1861 


CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 


1920 


Qy 


1921 


TATGAGAAGGAGGCACGGCTGAAAGAGACCATGCGGATCATGGGCCTGGACAACAGCATC 


1980 




IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


1921 


TATGAGAAGGAGGCACGGCTGAAAGAGACCATGCGGATCATGGGCCTGGACAACAGCATC 


1980 


Qy 


1981 


CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 


2040 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 




Db 


1981 


CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 


2040 


Qy 


2041 


GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 


2100 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


2041 


GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 


2100 


Qy 


2101 


TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 


2160 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


2101 


TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 


2160 


Qy 


2161 


TCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 


2220 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


2161 


TCCAGAGCCAA CCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCA CGCTGTACCTGCCC 


2220 


Qy 


2221 


TACGTCCTGTGTGTGGCATGGCAGGA CTA CGTGGGCTTCACACTCAAGATCTTCGCTAGC 


2280 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


2221 


TACGTCCTGTGTGTGGCATGGCAGGA CTA CGTGGGCTTCA CACTCAAGATCTTCGCTAGC 


2280 


Qy 


2281 


CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 


2340 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


2281 


CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 


2340 



Qy 


2341 


GGCATTGGAGTGCAGTGGGACAACCTOTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 


2400 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIilllMI 




Db 


2341 


GGCATTGGAGTGCAGTGGGACAACCTGn. X^tACjAU 1 i%jl\jtxAijOAiiUA itjIjC i x k-aa i 


2 4 00 


Qy 


2401 


CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 


2460 




IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 




Db 


2401 


CTCA C CA CTTTCGGTCTC C ATGATGCTG 1 1 l\iA C A C C 1 1 CCl Ci A l\3<iiM:ri\jA x\7A^l. itjVj 


2460 


Qy 


2461 


TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 


2520 




IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


2461 


TACATTGAGGCTGTC i i 1 CCAGGCCAGTACGGAA L ll_H_A(j<ji-t_t, IajOI Ai i i 


2520 


Qy 


2521 


ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 


2580 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIII 




Db 


2521 


ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 


•5 C ft A 


Qy 


2581 


AAGAGAATATCAGAAATCTGCATGGAGGAGGAACCCACCCACTTGAAGCTGGGCGTGTCC 


2640 




IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIII 




Db 


2581 


A AGA GAATATC AGA A ATCTGC ATGGAGGAGGA A CC C A C C C A CTTGAA GCTe<KJ ce 1 tj 


o c 4 n 
^ o 4 u 


Qy 


2641 


ATTCAGAACCTGGTAAAAGTCTACCGAGATGGGATGAAGGTGGCTGTCGATGGCCTGGCA 


2700 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIII 




Db 


2641 


ATTCAGAACCTGGTAAAAGTCTACCGAGATGGGATGAAGG 1 WjC i\3f i CCjA 1 ij<jt_i_ 1 uoCA 


•5 T n n 


Qy 


2701 


CTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATGGAGCGGGGAAGACG 


2760 




IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


2701 


CTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATGGAGCGGGGAAGACG 


276 0 


Qy 


2761 


ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 


2820 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


2761 


ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 


282 0 


Qy 


2821 


GGAAAAGACATTCGCTCTGAGATGAGCACCATCCGGCAGAACCTGGGGGTCTGTCCCCAG 


2880 




IIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


2821 


GGA AAAGA C ATTCGCTCTGAGATGAGC A CC ATC CGGC A GAA C CTGGGGGTCTGTCC C C AG 


2880 


Qy 


2881 


CATAACGTGCTGTTTGACATGCTGACTGTCGAAGAACACATCTGGTTCTATGCCCGCTTG 


2940 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


2881 


C ATA A CGTGCTGTTTGA C ATGCTGA CTGTCGA AGA A C A C ATCTGGTTCTATGC C CGCTTG 


2940 


Qy 


2941 


AAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCAGATGGCCCTGGATGTTGGT 


3000 




IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIMIIIII 




Db 


2941 


A A A GGGCTCTCTGAGAAGC A CGTGA AGGCGGAGATGGAGC A GATGGC CCTGGATGTTGGT 


3000 


Qy 


3001 


TTGCCATCAAGCAAGCTGAAAAGCAAAACAAGCCAGCTGTCAGGTGGAATGCAGAGAAAG 


3060 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIII 




Db 


3001 


TTGCCATCAAGCAAGCTGAAAAGCAAAACAAGCCAGCTGTCAGGTGGAATGCAGAGAAAG 


306 0 


Qy 


3061 


CTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTCTGGATGAACCCACA 


3120 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


3061 


CTATCTGTGGC CTTGGC CTTTGTCGGGGGATCT AAGGTTGTC ATTCTGGATGA A C C C A C A 


312 0 


Qy 


3121 


GCTGGTGTGGACCCTTACTCCCGCAGGGGAATATGGGAGCTGCTGCTGAAATACCGACAA 


3180 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMMIIIIII 




Db 


3121 


GCTGGTGTGGACCCTTACTCCCGCAGGGGAATATGGGAGCTGCTGCTGAAATACCGACAA 


3180 


Qy 


3181 


GGCCGCACCATTATTCTCTCTACACACCACATGGATGAAGCGGACGTCCTGGGGGACAGG 


3240 




IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 




Db 


3181 


GGCCGCA CCATTATTCTCTCTACA CACCAC ATGGATGAAGCGGA CGTCCTGGGGGA CAGG 


324 0 


Qy 


3241 


ATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCCTGTTTCTGAAGAAC 


3300 




ll!ll!!M!!!!!!!!!!!!!!!!!!!!!lllllll!llllll!lliliiiiiiliiii 




Db 


3241 


ATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCCTGTTTCTGAAGAAC 


3300 


Qy 


3301 


CAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGGAATCCTCCCTCAGT 


3360 




IIIIIIMIIIIIIIIIIIMIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIII 




Db 


3301 


CAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGGAATCCTCCCTCAGT 


3360 


Qy 


3361 


TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 


3420 



lllllllllllllllllllllllllllllllllllllllllllllillllllllllllll 



Db 


3361 


TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 


3420 


Qy 


3421 


AGCAGTTCTGATGCTGGCCTGGGCAGCGACCATGAGAGTGACACGCTGACCATCGATGTC 


3480 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIII 




Db 


3421 


AGCAGTTCTGATGCTGGCCTGGGCAGCGACCATGAGAGTGACACGCTGACCATCGATGTC 


3480 


Qy 


3481 


TCTGCTATCTCCAACCTCATCAGGAAGCATGTGTCTGAAGCCCGGCTGGTGGAAGACATA 


3540 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIII 




Db 


3481 


TCTGCTATCTCCAA CCTCATCAGGAAGCATGTGTCTGAAGCCCGGCTGGTGGAAGA CATA 


3540 


Qy 


3541 


GGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTAAGGAGGGAGCCTTTGTGGAA 


3600 




IMIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIMIIMIIIIIIIIMIIIIIIIII 




Db 


3541 


GGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTAAGGAGGGAGCCTTTGTGGAA 


3600 


Qy 


3601 


CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 


3660 




IIIMIMIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIMIIIIIII 




Db 


3601 


CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 


3660 


Qy 


3661 


GAGACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTGGGGTGGATGCTGAG 


3720 




lllllllllllllllllllll'IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIM 




Db 


3661 


GAGACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTGGGGTGGATGCTGAG 


3720 


Qy 


3721 


ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 


3780 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMMIII 




Db 


3721 


ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 


3780 


Qy 


3781 


TGTCTTCGCCCGTTCACTGAAGATGATGCTGCTGATCCAAATGATTCTGACATAGACCCA 


3840 




IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIII 




Db 


3781 


TGTCTTCGCCCGTTCACTGAAGATGATGCTGCTGATCCAAATGATTCTGACATAGACCCA 


3840 


Qy 


3841 


GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 


3900 




IMIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIMIIIIIMIMIII 




Db 


3841 


GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 


3900 


Qy 


3901 


GGCTGGAAACTTACACAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 


3960 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


3901 


GGCTGGAAACTTACA CAGCAA CAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 


3960 


Qy 


3961 


AGACGGAGTCGGAAAGGArrriTrGCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 


4020 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIM 




Db 


3961 


AGACGGAGTCGGAAAGGA'l'i''i"l"i"i'GCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 


4020 


Qy 


4021 


GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 


4080 




IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


4021 


GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 


4080 


Qy 


4081 


CCCTGGATGTACAACGAACAGTACACATTTGTCAGCAATGATGCTCCTGAGGACACGGGA 


4140 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 




Db 


4081 


CCCTGGATGTACAACGAACAGTACACATTTGTCAGCAATGATGCTCCTGAGGA CA CGGGA 


4140 


Qy 


4141 


ACCCTGGAA CTCTTAAACGCC CTCA C CA AAGA CCCTGGCTTCGGGACCCGCTGTATGGAA 


4200 




IIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


4141 


ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 


4200 


Qy 


4201 


GGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGTGGACCACTGCCCCA 


4260 




IIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIMIIMIII 




Db 


4201 


GGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGTGGACCACTGCCCCA 


4260 


Qy 


4261 


GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 


4320 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


4261 


GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 


4320 


Qy 


4321 


CCTGCATGCCAGTGTAGCAGCGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 


4380 




iiiiiiiiiiliiMiiiiiiiiiilllllliiillllilllMIIIIIIIIIIIIIIII 




Db 


4321 


CCTGCATGCCAGTGTAGCAGCGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 


4380 


Qy 


4381 


GCAGGGGGGCTGCCTCCTCCACAAAGAAAACAAAACACTGCAGATATCCTTCAGGACCTG 


4440 




IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIMIIIIII 




Db 


4381 


GCAGGGGGGCTGCCTCCTCCACAAAGAAAACAAAACACTGCAGATATCCTTCAGGACCTG 


4440 


Qy 


4441 


ACAGGAAGAAACATTTCGGATTATCTGGTGAAGACGTATGTGCAGATCATAGCCAAAAGC 


4500 



Illlllllllllllllllllllllllllilllllllllllllllllllllllllllllll 



Db 


4441 


ACAGGAAGAAACATTTCGGATTATCTGGTGAAGACGTATGTGCAGATCATAGCCAAAAGC 


4 500 


Qy 


4501 


TTAAAGAACAAGATCTGGGTGAATGAGTTTAGGTATGGCGGCTTTTCCCTGGGTGTCAGT 


4560 




IIMIIIIIIIIIIIIIIIIIIIIIIIilllllllllllllllllilllMIIIIIIIII 




Db 


4501 


TTAAAGAACAAGATCTGGGTGAATGAGTTTAGGTATGGCGGCTTTTCCCTGGGTGTCAGT 


4560 


Qy 


4561 


AATACTCAAGCACTTCCTCCGAGTCAAGAAGTTAATGATGCCACCAAACAAATGAAGAAA 


4620 




IIIIIIIIIIIIIIIMIIIIIIIillllllMIIIIIIIIIIIIIIMIIIIIIIIIII 




Db 


4561 


AATACTCAAGCACITCCTCCGAGTCAAGAAGTTAATGATGCCACCAAACAAATGAAGAAA 


4 62 0 


Qy 


4621 


CACCTAAAGCTGGCCAAGGACAGTTCTGCAGATCGATTTCTCAACAGCTTGGGAAGATTT 


4680 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


4621 


CACCTAAAGCTGGCCAAGGACAGTTCTGCAGATCGATTTCTCAACAGCTTGGGAAGATTT 


4 680 


Qy 


4681 


ATGACAGGACTGGACACCAGAAATAATGTCAAGGTGTGGTTCAATAACAAGGGCTGGCAT 


4740 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIII 




Db 


4681 


ATGACAGGACTGGACACCAGAAATAATGTCAAGGTGTGGTTCAATAACAAGGGCTGGCAT 


474 0 


Qy 


4741 


GCAATCAGCTCTTTC CTGAATGTCATCAACAATGCCATTCTCCGGGCCAA CCTGCAAAAG 


4800 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


4741 


GCA ATC AGCTCTTTC CTGAATGTC ATC A A C AATGCC ATTCTCCGGGC C AA C CTGC A A A AG 


4800 


Qy 


4801 


GGAGAGAACCCTAGCCATTATGGAATTACTGCITTCAATCATCCCCTGAATCTCACCAAG 


4860 




iiiiiiiiiiiiiiiniiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiiiii 




Db 


4801 


GGAGAGAACCCTAGCCATTATGGAATTACTGCTTTCAATCATCCCCTGAATCTCACCAAG 


4860 


Qy 


4861 


CAGCAGCTCTCAGAGGTGGCTCCGATGACCACATCAGTGGATGTCCTTGTGTCCATCTGT 


4920 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 




Db 


4861 


CAGCAGCTCTCAGAGGTGGCTCCGATGACCACATCAGTGGATGTCCTTGTGTCCATCTGT 


4 92 0 


Qy 


4921 


GTCATCTTTGCAATGTCCTTCGTCCCAGCCAGCTTTGTCGTATTCCTGATCCAGGAGCGG 


4980 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


4921 


GTCATCTTTGCAATGTCCTTCGTCCCAGCCAGCTTTGTCGTATTCCTGATCCAGGAGCGG 


4980 


Qy 


4981 


GTCAGCAAAGCAAAACACCTGCAGTTCATCAGTGGAGTGAAGCCTGTCATCTACTGGCTC 


5 04 0 




IIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


4981 


GTCAGCAAAGCAAAACACCTGCAGTTCATCAGTGGAGTGAAGCCTGTCATCTACTGGCTC 


5040 


Qy 


5041 


TCTAATTTTGTCTGGGATATGTGCA ATTA CGTTGTCCCTGC CA CACTGGTC ATTATCATC 


5100 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


5041 


TCTAATTTTGTCTGGGATATGTGCA ATTA CGTTGTCCCTGC CA CACTGGTCATTATC ATC 


5100 


Qy 


5101 


TTCATCTGCTTCCAGCAGAAGTCCTATGTGTCCTCCACCAATCTGCCTGTGCTAGCCCTT 


5160 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


5101 


TTCATCTGCTTCCAGCAGAAGTCCTATGTGTCCTCCACCAATCTGCCTGTGCTAGCCCTT 


5160 


Qy 


5161 


CTACTTTTGCTGTATGGGTGGTCAATCACACCTCTCATGTACCCAGCCTCCTTTGTGTTC 


5220 




IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIi 




Db 


5161 


CTACTTTTGCTGTATGGGTGGTCAATCACACCTCTCATGTACCCAGCCTCCTTTGTGTTC 


522 0 


Qy 


5221 


AAGATCCCCAGC A CAGCCTATGTGGTGCTCA CCAGCGTGAA CCTCTTCATTGGCATTAAT 


5280 




lllllllllllllllllllllllllllllllllllllllllllllllllllllllillll 




Db 


5221 


AAGATCCCCAGCACAGCCTATGTGGTGCTCACCAGCGTGAACCTCTTCATTGGCATTAAT 


5280 


Qy 


5281 


GGCAGCGTGGCCACCTTTGTGCTGGAGCTGTTCACCGACAATAAGCTGAATAATATCAAT 


5340 




IIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


5281 


GGCAGCGTGGCCACCTTTGTGCTGGAGCTGTTCACCGACAATAAGCTGAATAATATCAAT 


5340 


Qy 


5341 


GATATCCTGAAGTCCGTGTTCTTGATCTTCCCACATTTTTGCCTGGGACGAGGGCTCATC 


5400 




1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 




Db 


5341 


GATATCCTGAAGTCCGTGTTCTTGATCTTCCCACATTTTTGCCTGGGACGAGGGCTCATC 


5400 


Qy 


5401 


GACATGGTGAAAAACCAGGCAATGGCTGATGCCCTGGAAAGGTTTGGGGAGAATCGCTTT 


5460 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


5401 


GACATGGTGAAAAACCAGGCAATGGCTGATGCCCTGGAAAGGTTTGGGGAGAATCGCTTT 


5460 


Qy 


5461 


GTGTCACCATTATCTTGGGACTTGGTGGGACGAAACCTCTTCGCCATGGCCGTGGAAGGG 


5520 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


5461 


GTGTCACCATTATCTTGGGACTTGGTGGGACGAAACCTCTTCGCCATGGCCGTGGAAGGG 


5520 



Qy 


5521 


GTGGTGTTCTTCCTCATTACTGTTCTGATCCAGTACAGATTCTTCATCAGGCCCAGACCT 


5580 




illlllllllllllllllllllllllllllllllllllllllllllllllllllllllM 




Db 


5521 


GTGGTGTTCITCCrCATTACTGTTCTGATCCAGTACAGATTCITCATCA^^ 


5580 


Qy 


5581 


GTAAATGCAAAGCTATCTCCTCTGAATGATGAAGATGAAGATGTGAGGCGGGAAAGACAG 


5640 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIMI 




Db 


5581 


GTAAATGCAAAGCTATCTCCTCTGAATGATGAAGATGAAGATGTGAGGCGGGAAAGACAG 


5640 


Qy 


5641 


AGAATTCTTGATGGTGGAGGCCAGAATGACATCTTAGAAATCAAGGAGTTGACGAAGATA 


5700 




IIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 




Db 


5641 


AGAATTCTTGATGGTGGAGGCCAGAATGACATCTTAGAAATCAAGGAGTTGACGAAGATA 


5700 


Qy 


5701 


TATAGAAGGAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCATTCCTCCTGGTGAG 


5760 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMI 




Db 


5701 


TATAGAAGGAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCATTCCTCCTGGTGAG 


5760 


Qy 


5761 


TGCTTTGGGCTCCTGGGAGTTAATGGGGCTGGAAAATCATCAACTTTCAAGATGTTAACA 


5820 




IIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIMIIIM 




Db 


5761 


TGCTTTGGGCTCCTGGGAGTTAATGGGGCTGGAAAATCATCAACTTTCAAGATGTTAACA 


5820 


Qy 


5821 


GGAGATACCACTGTTACCAGAGGAGATGCTTTCCTTAACAGAAATAGTATCTTATCAAAC 


5880 




IMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


5821 


GGAGATA CCACTGTTACCAGAGGAGATGCTTTCCTTAA CAGAAATAGTATCTTATCAAAC 


5880 


Qy 


5881 


ATCC ATGAAGTAC ATCAGAA CATGGGCTA CTGCCCTCAGTTTGATGCCATCACAGAGCTG 


5940 




IIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 




Db 


5881 


ATCC ATGAAGTAC ATCAGAA CATGGGCTA CTGCCCTCAGTTTGATGCCATCA CAGAGCTG 


5940 


Qy 


5941 


TTGACTGGGAGAGAA C ACGTGGAGTTCnTGCCCTTTTGAGAGGAGTCC CAGAGAA AGAA 


6000 




IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIII 




Db 


5941 


TTGACTGGGAGAGAACACGTGGAGTTCTTTGCCCTTTTGAGAGGAGTCCCAGAGAAAGAA 


6000 


Qy 


6001 


GTTGGCAAGGTTGGTGAGTGGGCGATTCGGAAACTGGGCCTCGTGAAGTATGGAGAAAAA 


6060 




IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


6001 


GTTGGCAAGGTTGGTGAGTGGGCGATTCGGAAACTGGGCCTCGTGAAGTATGGAGAAAAA 


6060 


Qy 


6061 


TATGCTGGTAACTATAGTGGAGGCAACAAACGCAAGCTCTCTACAGCCATGGCTTTGATC 


6120 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


6061 


TATGCTGGTAACTATAGTGGAGGCAACAAACGCAAGCTCTCTACAGCCATGGCTTTGATC 


6120 


Qy 


6121 


GGCGGGCCTCCTGTGGTGTTTCTGGATGAACCCACCACAGGCATGGATCCCAAAGCCCGG 


6180 




IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


6121 


GGCGGGCCTCCTGTGGTGTTTCTGGATGAACCCACCACAGGCATGGATCCCAAAGCCCGG 


6180 


Qy 


6181 


CGGTTCTTGTGGAATTGTGCCCTAAGTGTTGTCAAGGAGGGGAGATCAGTAGTGCTTACA 


6240 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


6181 


CGGTTCTTGTGGAATTGTGCCCTAAGTGTTGTCAAGGAGGGGAGATCAGTAGTGCTTACA 


6240 


Qy 


6241 


TCTCATAGTATGGAAGAATGTGAAGCTCTTTGCACTAGGATGGCAATCATGGTCAATGGA 


6300 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


6241 


TCTCATAGTATGGAAGAATGTGAAGCTCTTTGCACTAGGATGGCAATCATGGTCAATGGA 


6300 


Qy 


6301 


AGGTTCAGGTGCCTTGGCAGTGTCCAGCATCTAAAAAATAGGTTTGGAGATGGTTATACA 


6360 




IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 




Db 


6301 


AGGTTCAGGTGCCTTGGCAGTGTCCAGCATCTAAAAAATAGGTTTGGAGATGGTTATACA 


6360 


Qy 


6361 


ATAGTTGTACGA ATAGCAGGGTCCA A CCCGGACCTGAAGCCTGTC CAGGATTTCTTTGGA 


6420 




IIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 




Db 


6361 


ATAGTTGTACGAATAGCAGGGTCCAACCCGGACCTGAAGCCTGTCCAGGATTTCTTTGGA 


6420 


Qy 


6421 


CTTGCATTTCCTGGAAGTGTTCCAAAAGAGAAACACCGGAACATGCTACAATACCAGCTT 


6480 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


6421 


CTTGCATTTCCTGGAAGTGTTCCAAAAGAGAAACACCGGAACATGCTACAATACCAGCTT 


6480 


Qy 


6481 


CCATCTTCATTATCTTCTCTGGCCAGGATATTCAGCATCCTCTCCCAGAGCAAAAAGCGA 


6540 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllli 




Db 


6481 


CCATCTTCATTATCTTCTCTGGCCAGGATAITCAGCATCCTCTCCCAGAGCAAAAAGCGA 


6540 


Qy 


6541 


CTCCACATAGAAGACTACTCTGTTTCTCAGA CAACACTTGA CCAAGTATTTGTGAACTTT 


6600 






IIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 




Db 


6541 


CTCCACATAGAAGACTACTCTGTTTCTCAGACAACACTTGACCAAGTATTTGTGAACTTT 


6600 



Qy 66 01 GCCAAGGACCAAAGTGATGATGACCACTTAAAAGACCTCTCATTACACAAAAACCAGACA 6660 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 6601 GCCAAGGACCAAAGTGATGATGACCACTTAAAAGACCTCTCATTACACAAAAACCAGACA 66 60 

Qy 6661 GTAGTGGACGTOK:AGTTCTCACATCTTTTCTACAGGATGAGAAAGTGAAAGAAAGCTAT 672 0 

IIMIIIIIIIIIIIIIIIIIMMIIIIIIIMIIIIIillliilllllllllllMM 

Db 6661 GTAGTGGACGTTGCAGTTCTCACATCTTTTCTACAGGATGAGAAAGTGAAAGAAAGCTAT 6720 

Qy 6721 GTATGAAGAATCCTGTTCATACGGGGTGGCTGAAAGTAAAGAGGGACTAGACTTTCCTTT 6780 

IIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII lllllllllinill 

Db 6721 GTATGAAGAATCCTGTTCATACGGGGTGGCTGAAAGTAAAGAGGNACTAGACTTTCCTTT 67 80 

Qy 6 781 GCACCATGTGAAGTGTTGTGGAGAAAAGAGCCAGAAGTTGATGTGGGAAGAAGTAAACTG 6840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 6781 GCACCATGTGAAGTGTTGTGGAGAAAAGAGCCAGAAGTTGATGTGGGAAGAAGTAAACTG 6840 

Qy 6841 GATACTGTACTGATACTATTCAATGCAATGCAATTCAATG 6880 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 6841 GATACTGTACTGATACTATTCAATGCAATGCAATTCAATG 688 0 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


6880 


100 . 


0 


6880 


21 


AAZ94734 


Human ATP binding 


2 


6880 


100. 


0 


6880 


22 


AAI70314 


Human ATP binding 


3 


6880 


100. 


0 


7260 


22 


AAD21326 


Human ATP binding 


4 


6880 


100. 


0 


7260 


22 


AAI70315 


Human ATP binding 


5 


6879 


100. 


0 


9497 


24 


ABV78 02 0 


Hypoxia- regulated 


6 


6876 . 


.8 


100. 


0 


6880 


22 


AAD2132 5 


Human ATP binding 


7 


6875 , 


.8 


99, 


9 


9741 


22 


AAS06120 


Human ABCl DNA seq 


8 


6875 , 


.8 


99 . 


9 


9741 


24 


AAD37273 


Human ABCl full-le 


9 


6875 , 


.8 


99 . 


9 


9854 


22 


AAS06121 


Human ABCl DNA seq 


10 


6875 . 


.8 


99 . 


9 


9870 


24 


ABN993 01 


Polymorphic 


human 


11 


6875 , 


.8 


99. 


9 


9870 


24 


ABN9 93 02 


Polymorphic 


human 


12 


6875 , 


.8 


99. 


9 


9870 


24 


ABN99303 


Polymorphic 


human 


13 


6875 , 


.8 


99. 


.9 


9870 


24 


ABN99324 


Polymorphic 


human 


14 


6875, 


.8 


99. 


,9 


9870 


24 


ABN99328 


Polymorphic 


human 


15 


6875, 


.8 


99. 


,9 


9870 


24 


ABN9932 9 


Polymorphic 


human 


16 


6875, 


.8 


99. 


,9 


9870 


24 


ABN99330 


Polymorphic 


human 


17 


6875, 


.8 


99. 


,9 


9870 


24 


ABN99331 


Polymorphic 


human 


18 


6875, 


.8 


99. 


,9 


9870 


24 


ABN99332 


Polymorphic 


human 


19 


6875, 


.8 


99. 


,9 


9870 


24 


ABN99333 


Polymorphic 


human 


20 


6875, 


.8 


99. 


.9 


9870 


24 


ABN99334 


Polymorphic 


human 


21 


6875 


.8 


99 . 


, 9 


9870 


24 


ABN99335 


Polymorphic 


human 


22 


6875 


.2 


99. 


.9 


7281 


22 


AAK51683 


Human polynucleoti 


23 


6874 


.2 


99. 


.9 


9741 


24 


ABL5 814 6 


Human ABCAl 


transp 


24 


6874 


.2 


99. 


.9 


9870 


24 


ABN9 93 04 


Polymorphic 


human 


25 


6874 


.2 


99. 


.9 


9870 


24 


ABN99305 


Polymorphic 


human 


26 


6874 


.2 


99 . 


.9 


9870 


24 


ABN99306 


Polymorphic 


human 


27 


6874 


.2 


99 , 


. 9 


9870 


24 


ABN993 07 


Polymorphic 


human 


28 


6874 


.2 


99, 


. 9 


9870 


24 


ABN99309 


Polymorphic 


human 


29 


6874 


.2 


99, 


.9 


9870 


24 


ABN99310 


Polymorphic 


human 


30 


6874 


.2 


99, 


.9 


9870 


24 


ABN99311 


Polymorphic 


human 


31 


6874 


.2 


99, 


.9 


9870 


24 


ABN99312 


Polymorphic 


human 


32 


6874 


.2 


99, 


.9 


9870 


24 


ABN99313 


Polymorphic 


human 


33 


6874 


.2 


99 


.9 


9870 


24 


ABN99314 


Polymorphic 


human 


34 


6874 


.2 


99 


.9 


9870 


24 


ABN9 9315 


Polymorphic 


human 


35 


6874 


.2 


99 


.9 


9870 


24 


ABN99316 


Polymorphic 


human 


36 


6874 


.2 


99 


.9 


9870 


24 


ABN99317 


Pol^'mcrphic 


human 


37 


6874 


.2 


99 


.9 


9870 


24 


ABN99318 


Polymorphic 


human 


38 


6874 


.2 


99 


.9 


9870 


24 


ABN99319 


Polymorphic 


human 


39 


6874 


.2 


99 


. 9 


9870 


24 


ABN9932 0 


Polymorphic 


human 


40 


6874 


.2 


99 


.9 


9870 


24 


ABN99321 


Polymorphic 


human 


41 


6874 


.2 


99 


.9 


9870 


24 


ABN99322 


Polymorphic 


human 


42 


6874 


.2 


99 


.9 


9870 


24 


ABN99323 


Polymorphic 


human 


43 


6874 


.2 


99 


.9 


9870 


24 


ABN99325 


Polymorphic 


human 


44 


6874 


.2 


99 


.9 


9870 


24 


ABN99326 


Polymorphic 


human 



45 6874.2 



99.9 9870 24 ABN9932 7 



Polymorphic human 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 


430.8 


6 . 


.3 


5894 * 


3 


US-08-665-259-24 


Sequence 


24, Appl 




2 


430.8 


6. 


.3 


5894 


3 


US-08-762-500-24 


Sequence 


24, Appl 




3 


430.8 


6. 


.3 


6525 


3 


US-08-762-500-74 


Sequence 


74, Appl 




4 


99.4 


1 , 


.4 


1008 


4 


US-09-252-991A-42 66 


Sequence 


4266, Ap 


c 


5 


98 .8 


1, 


.4 


999 


4 


US-09-2 52-991A-4588 


Sequence 


4588, Ap 




6 


89.4 


1, 


.3 


1614 


4 


US-09-252-991A-4198 


Sequence 


4198, Ap 




7 


85 .8 


1, 


.2 


1824 


4 


US-09-2 52-991A-1780 


Sequence 


1780, Ap 




8 


83 .8 


1, 


.2 


1176 


4 


US-09-2 52-991A-3704 


Sequence 


3704, Ap 


c 


9 


83 .8 


1, 


.2 


2253 


4 


US-09-2 52-991A-3810 


Sequence 


3810, Ap 




10 


81 


1. 


.2 


36181 


4 


US-08-311-731A-12 0 


Sequence 


12 0, App 




11 


78.4 


1, 


.1 


1548 


4 


US-09-252-991A-3653 


Sequence 


3653, Ap 


c 


12 


77.4 


1, 


.1 


762 


4 


US-09-252-991A-1938 


Sequence 


1938, Ap 


c 


13 


76.6 


1. 


.1 


4411529 


3 


US-09-103-840A-1 


Sequence 1, Appli 




14 


75.2 


1. 


.1 


969 


4 


US-09-107-532A-1008 


Sequence 


1008, Ap 




15 


75 


1 


.1 


1770 


4 


US-09-328-352-3466 


Sequence 


3466, Ap 



Result 
No. 



Score 



% 

Query 

Match Length DB 



SUMMARIES 



ID 



Description 



c 
c 



1 


3600 


.4 


52 , 


.3 


4783 


11 


AK051920 


AK051920 


Mus muscu 


2 


1024 , 


.8 


14, 


.9 


3605 


11 


AK052916 


AK052916 


Mus muscu 


3 


692 , 


.4 


10, 


.1 


809 


12 


BI754756 


BI754756 


603025477 


4 


636 , 


.6 


9, 


.3 


854 


12 


BI854140 


BI854140 


603381449 


5 


624 . 


.4 


9, 


.1 


754 


14 


CD351847 


CD351847 


UI-M-GIO- 


6 


591, 


.2 


8, 


.6 


878 


12 


BI182779 


BI182779 


UNL-P-FN- 


7 


587 , 


.8 


8, 


.5 


673 


14 


CD351490 


CD351490 


UI-M-GIO- 


8 


550, 


.4 


8 , 


.0 


661 


14 


CB481810 


CB481810 


jns83_A02 


9 


547 , 


.6 


8 , 


.0 


773 


14 


CB526974 


CB526974 


UI-M-FYO- 


10 


542 


7 


.9 


837 


10 


BF160011 


BF160011 


601768192 


11 


536 


.8 


7 


.8 


652 


10 


BB468374 


6B468374 


BB468374 


12 


535 


.2 


7 


.8 


702 


12 


BI658600 


BI658600 


603284335 


13 


532 


.4 


7 , 


.7 


724 


12 


BG920223 


BG92 0223 


602822304 


14 


528 


.4 


7 


.7 


639 


14 


CB477340 


CB477340 


jns2 0_Cll 


15 


525 


7 


.6 


720 


12 


BI248317 


BI248317 


602966918 


16 


516 


7 


.5 


620 


14 


CA889361 


CA889361 


B0152D10- 


17 


510. 


.4 


7 


.4 


512 


13 


BX279856 


BX279856 


BX279856 


18 


505 


7 


.3 


614 


14 


CB478239 


CB478239 


jns32_B02 


19 


486 


.4 


7 


.1 


605 


14 


CB546553 


CB546553 


AMGNNUC:N 


20 


479 


.8 


7 


. 0 


579 


14 


CA884367 


CA884367 


B0109G01- 


21 


477 


6 , 


.9 


1004 


11 


BC034824 


BC034824 


Homo sapi 


22 


473 


6 


.9 


562 


10 


BE665489 


BE665489 


154542 MA 


23 


459 


.4 


6 


.7 


515 


10 


BF094524 


BF094524 


MR0-UT004 


24 


456, 


.8 


6 


.6 


605 


14 


CB581583 


CB581583 


AMGNNUC:N 


25 


447 


.6 


6 


.5 


554 


10 


BF076322 


BF076322 


225856 MA 


26 


438 


.4 


6 


.4 


512 


14 


CB717058 


CB717058 


AMGNNUC : N 


27 


435 


.6 


6 


.3 


884 


13 


BU906084 


BU906084 


AGENCOURT 



RESULT 1 
AK051920 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



AK051920 4783 bp mRNA linear HTC 05-DEC-2002 

Mus mus cuius 12 days embryo eyeball cDNA, RIKEN full-length 
enriched library, clone : D230019D04 product : ATP -binding cassette, 
sub-family A (ABCl) , member 1, full insert sequence. 
AK051920 

AK051920.1 GI :26342297 

HTC; CAP trapper. 

Mus musculus (house mouse) 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AXTTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



Mus musculus 

Eukaryota; Metazoa; Chordata; Creiniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. aind Hayashizaki , Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Caminci,P., Shibata^Y., Hayatsu,N., Sugahara^Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki,Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Caminci,P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Mat sumo t o, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K . , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramat su , M . , Inoue,Y., Kira,A. and Hayashizaki,Y. 
RIKEN integrated sequence analysis (RISA) system- -384 -format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 



11076861 
4 

Kawai , J . , 
Arakawa,T 
Aizawa, K. 
Saito, T. , 
Kadota,K. 



, Ishii,Y., 
Fukuda , S . , 
Kondo , S . , Yamanaka , I . ^ 
Kasukawa,T., Saito, R., 



Shinagawa , A . , Shibata,K., Yoshino,M., Itoh,M. 
, Hara,A., Fukunishi, Y. , Konnb,H., Adachi,J., 

Izawa,M., Nishi,K., Kiyosawa,H. 
Okazaki,Y., Gojobori,T., Bono,H. 
Matsuda,H., Ashburner, M . , Batalov,S., Casavant,T., 
Flei3chmann,W. , Gaasterland, T . , Gissi,C., King,B., Kochiwa,H., 
Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,!., Pesole,G., 
Quackenbush, J . , Schriml , L .M . , Staubli,F., Suzuki, R., Tomita,M., 
Wagner, L,, Washio,T., S£Lkai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli,R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci,P., de Bonaldo, M . F . , Browns t ein, M , J . , Bult,C., 
Fletcher, C, Fujita,M., Gariboldi , M . , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee,N.H., Lyons, P., 
Marchionni , L . , Mashima,J., Mazzarelli , J . , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto, N., Sasaki, H., 
Sato,K., Schonbach,C., Seya,T., Shibata,Y., S torch, K.F., Suzuki, H., 
Toyo-oka,K., Wang,K.H., Weitz,C., Whittaker , C . , Wilming,L., 
Wynshaw-Boris,A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y . 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821), 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I fit II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 4783) 

Adachi,J., Aizawa, K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda, S., Furuno,M., Hanagaki,T., Hara.A., Hashizume , W . , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,!., Kasukawa,T., 
Katoh,H., Kawai, J., Kojima,Y., Kondo, S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C., Matsuyama, T , , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito, R., Saitoh, H., Sakai,C., Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A . , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Taksihashi , F . , Takaku-Akahira, S . , 



Takeda,Y., Tanaka,T.. Tomaru,A., Toya,T., Yasxmishi , A . , 
Muramatsu,M. and Hayashizaki , Y . 
TITLE Direct Submission 

JOURNAL Submitted {16-JUL-2 001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 2 30-0045, Japan (E-mail : genome- res@gsc . riken .go. jp, 
URL :http: //genome. gsc .riken.go.jp/, Tel : 81-4 5 -503 -9222 , 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laiboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues . 

Please visit our web site for further details. 
URL:http: //genome .gsc .riken.go.jp/ 
URL :http : //fantom.gsc . riken.go.jp/ . 
FEATURES Location/Qualifiers 
source 1. .4783 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strains '•C57BL/6J" 

/db_xre f = " FANTOM_DB : D2 3 0 0 1 9D04 

/ db_x r e f " t axon : 1 0 0 9 0 " 

/clone="D230019D04" 

/tissue_type=" eyeball" 

/clone_lib= "RIKEN full-length enriched mouse cDNA library" 
/dev_stage="12 days embryo" 
CDS 322. .>4782 

/notes "unnamed protein product; ATP-binding cassette, 

sub-family A (ABCl) , member 1 (MGD | MGI : 99607 , 

GB|NM_013454, evidence: BLASTN, 99%, match=4545) 

putative " 

/codon_start=l 

/protein_id=^"BAC34811 . 1" 

/db_xref="GI : 263422 98" 

/translation="MACWPQLRLLLWKNLTFRRRQTCQLLLEVAWPLFIFLILISVRL 
SYPPYEQHECHFPNKAMPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIV 
SRLFSDAQRLLLYSQRDTS I KDMHKVLRMLRQ I KHPNSNLKLQDFLVDNETFSGFLQH 
NLSLPRSTVDSLLQANVGLQKVFLQGYQLHLASLCNGSKLEE I IQLGDAEVSALCGLP 
RKKLDAAERVLRYNMD I LKPWTKLNSTSHLPTQHLAE ATTVLLDSLGGLAQE LFSTK 
S WSDMRQE VMFLTNVNS S S S STQ I YQAVS RI VCGHPEGGGLKI KSLNWYE DNNYKALF 
GGNNTEEDVDTFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPAT 
RQVMAEVNKTFQELAVFHDLEGMWEELSPQIWTFMENSQEMDLVRTLLDSRGNDQFWE 
QKLDGLDWTAQDIMAFLAKNPEDAQSPNGSVYTWREAFNETNQAIQTISRFMECVNLN 
KLEPIPTEVRLINKSMELLDERKFWAGIVFTGITPDSVELPHHVKYKIRMDIDNVERT 
NKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDWEQAIIRVLTGSEKKTGVYVQQMPY 
PCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKSIVYEKEARLKETMRIMGLDNGILW 
FSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFVFLSVFAMVTILQCFLISTLF 
SRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFSIKIFASLLSPVAFGFGCEYFALFE 
EQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPW 
YFPCTKSYWFGEEIDEKSHPGSSQKGVSEICMEEEPTHLRLGVSIQNLVKVYRDGMKV 
AVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSSIR 
QNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPPSKLKSKT 
SQLS GGMQRKLS VALA FVGGS KW I LDE PTAGVDP YSRRG I WELLLKYRQGRT 1 1 LST 
HHMDEADI LGDRI A 1 1 SHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSS CRNSS 
STVS CL KKEDS VSQS S SDAGLGSDHE S DTLT I DVS A I SNL I RKHVSE ARLVED I GHEL 
TYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETS 
DGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 
GWKLTQQQFVALLWKRLL I ARRSRKGFFAQ I VLPAVFVC I ALVFSL I VPPFGKYPSLE 
LQPWMYNEQ YTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNP I PDTPCLAGEEDW 
TI SPVPQS I VDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPPP " 

BASE COUNT 1119 a 1236 c 1310 g 1118 t 

ORIGIN 



Query Match 52.3%; Score 3 600.4; DB 11; Length 478 3; 

Best Local Similarity 88.6%; Pred. No. 0; 

Matches 3901; Conservative 0; Mismatches 501; Indels 0; Gaps 0; 



Qy 



1 CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 6 0 



Illlllllllllllllllllllllllllllllllllll lllllllllllllllllllll 



Db 


382 


C AAA CATGTC AGCTGTTACTGGAAGTGGCCTGGCCTCTC 1 1 i ATCT^ C IXarA i 1\jA l v_ 


441 


Qy 


61 


TCTGTTCGGCTGAGCTACCCACCCTATGAACAACATGAATGCCATTTTCCAAATAAAGCC 


12 0 




Mill II lllllllllllllllll lllllllllll Mill mil IMIIilll 




Db 


442 


TCTGTACGCCrcAGCTACCCACCCTACGAACAACATGAGTGCCACTT.T. t-t_tjAA 1 AAAtrt-i, 


501 


Qy 


121 


ATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACCCC 


180 




Mill IIIIMIIIII II II MMI IIIIIIIIIIMIIIIMIMMIIIIMI 




Db 


502 


ATGC CGTCTGC A GGAA C C CTC C CCTGGGTA C A GGGGATTATCTGTA ATGC C AA C AA C C CC 




Qy 


181 


TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 


240 




II iiiniii II iiiii II lllllllllll iiiiiiiiiiiiiiiiiMi III 




Db 


562 


TGCTTC CGTTATCCAA CTC CCGGCGAGGCTC C CGGTGTTGTTGGAA A CTTTAA C A AGTC C 


62 1 


Qy 


241 


ATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCTTCTTTTATACAGCCAGAAAGACACC 


300 




II III III III III II III II MM IIIII Mill 1 MM MM 1 III III 




Db 


622 


ATCGTGTCTCGCCTGTTCTCAGA CGCTC AGAGGCTTCTTCTGTA C AGC C A AAGAGATA C C 


cm 
box 


Qy 


301 


AGCATGAAGGACATGCGCAAAGTTCTGAGAACATTACAGCAGATCAAGAAATCCAGCTCA 


360 




Mill IMIMMII Ml M IMMII MM MIIMMII 1 Ml MM 




Db 


682 


A GC ATTA AGGA C ATGC A C AAGGTC CTGAGAATGTTA CGGC AGATC A AGC ATC C C A A CTC A 




Qy 


361 


AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 


420 




M IIMIMI II Mill IIMIIIMIMIIIII IMIMM MMI! 1 Ml 




Db 


742 


A ATTTGAA GCTCC AGGATTTTCTGGTGGA C AATGA A A C ATTCTCTGGATTC CTGC AGC A C 


8 01 


Qy 


421 


AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 


480 




II 1 II II MM lllllllllllll IIIII III Mil MMII 




Db 


802 


A ATTTGTC C CTTC C A AGATCT ACTGTGGA C A GC CTGCTGC AGGCGA ATGTTGGTCTC C AG 


861 


Qy 


481 


AAGGTATTTTTGCAAGGCTACCAGTTACATTTGACAAGTCTGTGCAATGGATCAAAATCA 


540 




IMMMMIMIMIIIIMM IIIIIIIM 1 llllllll II IMIIIIIII 1 




Db 


862 


A AGGTATTTTTGC A A GGCTA CC A A TT A C ATTTGGC C AGTCTGTGTA A CGGA TC A A A ATTA 


921 


Qy 


541 


GAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 


600 




Mill M MMI MIIIIM IMIMMII II Mill Mill III M 




Db 


92 2 


GAA GA AATTATTC AGCTTGGTGA TGCGGAAGTTTCTGC C CTCTGTGGTCTA C CGAGGA AG 


981 


Qy 


601 


AAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCAACATGGACATCCTGAAGCCAATCCTG 


660 




II II 1 IMMI III lllllll II 1 IIIIIIIIMIIIMIIIIIM 1 II 




Db 


982 


AAGCTCGATGC AGCCGAGAGAGTACTGCGCTA CAA CATGGA CATCCTGAAGCCAGTTGTG 


1041 


Qy 


661 


AGAACACTAAACTCTACATCTCCCTTCCCGAGCAAGGAGCTGGCCGAAGCCACAAAAACA 


720 




1 II IMMI II lllllll MUM 1 II 1 IIIII IIMIMI 1 1 




Db 


1042 


ACAAAACTAAATTCCACATCTCATCTCCCGACCCAGCATCTGGCTGAAGCCACCACAGTG 


1101 


Qy 


721 


TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 


780 




IIIII 1 II 1 III llllllll lllllllllllll 1 IIMIMI III 




Db 


1102 


TTGCTTGA C AGCTTGGGGGGC CTGGC C C A AGAGCTGTTC AGC AC A A AGA GCTGGAGCGA C 


1161 


Qy 


781 


ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 


840 




Mill IIIIIIIIIIIIMIIMIIIM IIIIIIIIIIMIIIIIIIIIIIIII III 




Db 


1162 


ATGCGGCAGGAGGTGATGTTTCTGACCAACGTGAACAGCTCCAGCTCCTCCACCCAGATC 


1221 


Qy 


841 


TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCGAGGGAGGGGGGCTGAAGATCAAG 


900 




MIIIIM IIIII II II II II II II M IIIII Mill IIIIMIIIIII 




Db 


1222 


TACCAGGCAGTGTCCCGCATCGTGTGTGGTCACCCAGAGGGTGGGGGCCTGAAGATCAAG 


1281 


Qy 


901 


TCTCTC AA CTGGTATGAGGA CAACAACTACAAAGCCCTCTTTGGAGGCAATGGC A CTGAG 


960 




M lllllllllll IIIII II II II 1 II 1 1 1 M 1 II II II II 1 1 1 Ml III III 
II lllllllllll IIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III III ill 




Db 


1282 


TCCCTCAACTGGTACGAGGATAACAACTACAAAGCCCTCTTTGGAGGGAATAACACCGAG 


1341 


Qy 


961 


GAAGATGCTGAAACCTTCTATGACAACTCTACAACTCCTTACTGCAATGATTTGATGAAG 


1020 




Mill 1 II IIIMIIIIMIII IIIIIIIMIIMI IIMIIIIIIIIIIIIM 




Db 


1342 


GAAGACGTGGACACCTTCTATGACAATTCTACAACTCCTTATTGCAATGATTTGATGAAG 


1401 


Qy 


1021 


AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTC6TT 


1080 




II lllillllllllllllllll II Mill IIIII II II IIIII IIIII Ml 




Db 


1402 


AACTTGGAGTCTAGTCCTCTTTCTCGAATTATTTGGAAGGCACTCAAGCCACTGCTTGTT 


1461 



Qy 


1081 


GGGAAGATCCTGTATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGGTGAAC 


1140 




II Mill II lllllllilllMI Mill IIIIMMIIMIIMIIIMIIIIII 




Db 


1462 


GGAAAGATTCTCTATACACCTGACACACCAGCTACAAGGCAGGTCATGGCTGAGGTGAAC 


1521 


Qy 


1141 


AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 


1200 




MMMM MM! IIIIIIIIMIIIMI Mill IIIIIIIIIM IMIIIMI 




Db 


1522 


AAGACCTTTCAGGAGTTGGCTGTGTTCCATGACCTGGAGGGCATGTGGGAAGAACTCAGC 


1581 


Qy 


1201 


CCCAAGATCTGGACCTTCATGGAGAACAGCCAAGAAATGGACCTTGTCCGGATGCTGTTG 


1260 




Ml 1 II IMIIIIIIIIIIMIMIIIIIIII IIIIMMMIIIIM llllll 




Db 


1582 


CCCCAAATTTGGACCTTCATGGAGAACAGCCAAGAGATGGACCTTGTCCGGACGCTGTTA 


1641 


Qy 


1261 


GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 


1320 




MMMM 1 IMIIIIII IIMIIIMMI MMIIIIM IIIMIIIIM Ml 




Db 


1642 


GACAGCAGAGGCAATGACCAGTTTTGGGAACAGAAGTTGGATGGATTAGATTGGACTGCC 


1701 


Qy 


1321 


CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 


1380 




IMIIIIII MMMM MMMM lllllll MM IMIII Mill III 




Db 


1702 


CAAGACATCATGGCGTTTCTGGCCAAGAACCCAGAAGATGCTCAGTCCCCAAATGGCTCT 


1761 


Qy 


1381 


GTGTACACCTGGAGAGAAGCTTTCAACGAGACTAACCAGGCAATCCGGACCATATCTCGC 


1440 




MM! IMMIIIIIIIIIIMIII Mill IMIMIIMIII Ml MIMMI 




Db 


1762 


GTGTATACCTGGAGAGAAGCTTTCAATGAGACCAACCAGGCAATCCAGACGATATCTCGA 


1821 


Qy 


1441 


TTCATGGAGTGTGTCAACCTGAACAAGCTAGAACCCATAGCAACAGAAGTCTGGCTCATC 


1500 




IIIMIIIMIIIMIIIMIMIIMII MMMM 1 MMMM! MMMM 




Db 


1822 


TTCATGGAGTGTGTCAACCTGAACAAGCTGGAACCCATTCCGACAGAAGTCAGGCTCATC 


1881 


Qy 


1501 


AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 


1560 




MIIIIMIIMIIIIIIIIIII IMIIMIIIIMIIIIIM M MMMM II 




Db 


1882 


AACAAGTCCATGGAGCTGCTGGACGAGAGGAAGTTCTGGGCTGGCATCGTGTTCACAGGC 


1941 


Qy 


1561 


ATTACTCCAGGCAGCATTGAGCTGCCCCATCATGTCAAGTACAAGATCCGAATGGACATT 


1620 




II lllllll II 1 IIIIIIIIIMIIIIII IIMMIIIIIMI IIIMIIII 




Db 


1942 


ATCACTCCAGATAGTGTGGAGCTGCCCCATCATGTAAAGTACAAGATCCGGATGGACATT 


2001 


Qy 


1621 


GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 


1680 




Mill MMMM II Mill IIIIIIIIIIMIIIMIIIMMIIIIIIII III 




Db 


2002 


GACAACGTGGAGAGAACTAATAAGATCAAGGATGGGTACTGGGACCCTGGTCCTCGGGCT 


2061 


Qy 


1681 


GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 


1740 




Mill MM! II Mill II MMMM M M II II II 1 II II II 1 II M 1 II II 




Db 


2062 


GACCCTTTTGAAGATATGCGCTATGTCTGGGGCGGCTTCGCCTACTTGCAGGATGTGGTG 


2121 


Qy 


1741 


GAGCAGGCAATCATCAGGGTGCTGACGGGCACCGAGAAGAAAACTGGTGTCTATATGCAA 


1800 




II Mill llllllll IIIIIIIIIM 1 IIIIIIIMM MMMM IIMI 




Db 


2122 


GAACAGGCCATCATCAGAGTGCTGACGGGATCTGAGAAGAAAA CGGGTGTCTA CGTGCAA 


2181 


Qy 


1801 


CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 


1860 




IIMIIIMII IIIIIIM MIIMIIIM IMIMIIIII IIIIIIMIIMIII 




Db 


2182 


CAGATGCCCTACCCCTGTTATGTTGATGACATTTTTCTGCGGGTCATGAGCCGGTCAATG 


2241 


Qy 


1861 


CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 


1920 




llllllllllllll II MMMM IIMI II IIIIIIIMMIIM MM III 




Db 


2242 


CCCCTCTTCATGACTCTAGCCTGGATCTACTCTGTCGCTGTGATCATCAAGAGCATTGTG 


2301 


Qy 


1921 


TATGAGAAGGAGGCACGGCTGAAAGAGACCATGCGGATCATGGGCCTGGACAACAGCATC 


1980 




IIIIIIIIIIIIM llllllll IMIMIIIMMMIIIII MMMM Mill 




Db 


2302 


TATGAGAAGGAGGCTCGGCTGAAGGAGACCATGCGGATCATGGGTCTGGACAATGGCATC 


2361 


Qy 


1981 


CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 


2 04 0 




IIIIMIIIMIMIII MM MMMM Mill IIIIIIMIMMIIIIIII 

lllllllllllllllll 1111 llllllll lllll IIIIIIIIIIIIIIIIIIVI 




Db 


2362 


CTCTGGTTTAGCTGGTTTGTTAGCAGCCTCATCCCTCTGCTTGTGAGCGCTGGCCTGCTG 


2421 


Qy 


2041 


GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 


2100 




IIIIMMI IIIIIIIIIMIIIIIIIIIIIIII Mill IIIIIIIIIIMII III 




Db 


2422 


GTGGTCATCTTGAAGTTAGGAAACCTGCTGCCCTATAGTGACCCCAGCGTGGTGTTCGTC 


2481 


Qy 


2101 


TTCCTGTCCGTGTTOSCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 


2160 




IIIIIIM IMMIII lllllll lllll IMIIIIIIM MMMM llllll 




Db 


2462 


TTCCTGTCTGTGTTTGCCATGGTGACCATCCTACAGTGCTTCCTCATTAGCACGCTCTTC 


2541 



Qy 


2161 


TCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 


2220 




III 1 lllllllllllllllllllllllllllllllllllllllillllllllllilll 




Db 


2542 


TC C CGTGC C AAC CTGGC AGC AGC CTGTGGGGGC ATC ATCTA CTTC A CGCTGTA CCTGCCC 


'5 c rt T 


Qy 


2221 


TACGTCCTGTGTGTGGCATGGCAGGACTACGTGGGCTTCACACTCAAGATCTTCGCTAGC 


2280 




II II Mill II II lllllllllll lllllllll 1 llllllllll MINI 




Db 


2602 


TATGTGCTGTGCGTAGC CTGGC AGGA CTATGTGGGCTTCTC CATC AAGA TCTTTGCTAGC 


Zoo X 


Qy 


2281 


CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 


2340 




Illllllllllllllllllllll II lllllllllll II llllllll llllllll 




Db 


2662 


CTGCTGTCTCCTGTGGCTTTTGGA TTCGGCTGTGA GTATTTCGC C CTTTTCGA GGAGC AA 


2721 


Qy 


2341 


GGCATTGGAGTGCAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 


2400 




II II II II II llllllll II llllllll II llllllll II lllllllll 




Db 


2722 


GGTATCGGGGTCC AATGGGA C AATCTCTTTGA GAGCCCGGTGGAGGAGGA CGGCTTC A AT 


2781 


Qy 


2401 


CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 


2460 




lllllllll 1 II lllllllllll lllllllllll llllllll llllllll III 




Db 


2782 


CTC A CC A CTGC A GTGTC C ATGATGCTCTTTGA C A CCTTTCTCTATGGCGTGATGA C ATGG 


O ft A1 


Qy 


2461 


TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 


2520 




IIIII II II lllllllllll IIIII lllllllllllllllllllllllllllll 




Db 


2842 


TA CATCGAAGC CGTCTTTC C AGGA C A GTATGGAATTCC C AGGC C CTGGTATTTTC CTTGT 


2 901 


Qy 


2521 


ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 


2580 




llllllll lllllllllll IIIIIII llllllllllllllllllllllllll nil 




Db 


2902 


ACCAAGTCATACTGGTTTGGTGAGGAAATTGATGAGAAGAGCCACCCTGGTTCCAGCCAG 


2961 


Qy 


2581 


AAGAGAATATCAGAAATCTGCATGGAGGAGGAACCCACCCACTTGAAGCTGGGCGTGTCC 


2640 




III II 1 IIIIIIIIIIIIIIIII lllllllllll II III mill mill 




Db 


2962 


AAGGGAGTGTCAGAAATCTGCATGGAAGAGGAACCCACTCATCTGAGGCTGGGGGTGTCC 


3021 


Qy 


2641 


ATTCAGAACCTGGTAAAAGTCTACCGAGATGGGATGAAGGTGGCTGTCGATGGCCTGGCA 


2700 




IIIIIIIIIIIIII II II iiiiimiii llllllll mil mm im 




Db 


3022 


ATTCAGAACCTGGTGAAGGTTTACCGAGATGGCATGAAGGTTGCTGTGGATGGCTTGGCG 


3081 


Qy 


2701 


CTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATGGAGCGGGGAAGACG 


2760 




II II Mill II llllllll llllllllllllllllllllllllll llllllll 




Db 


3082 


CTCAACTTTTA CGAAGGCC AGATTACCTCCTTCCTGGGCCA C A ATGGAGCAGGGAAGA CC 


3141 


Qy 


2761 


ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 


2820 




IIIIIIIIIIIIII IIIII III Mil IIIII II II Mill llllllllllll 




Db 


3142 


ACCACCATGTCAATACTGACTGGGCTGTTTCCCCCAACTTCTGGCACGGCCTACATCCTG 


32 01 


Qy 


2821 


GGAAAAGACATTCGCTCTGAGATGAGCACCATCCGGCAGAACCTGGGGGTCTGTCCCCAG 


2880 




II II lllllllllll iiiiiim lllllllllllllllllll miiiiiim 




Db 


3202 


GGGAAGGACATTCGCTCGGAGATGAGCTCCATCCGGCAGAACCTGGGAGTCTGTCCCCAG 


3261 


Qy 


2881 


CATAA CGTGCTGTTTGA CATGCTGA CTGTCGAAGAA CACATCTGGTTCTATGCCCGCTTG 


2940 




IIIII iiiiiiiiiiiiiiiiiiiiiiiiiim IIIIIIIIIIIIIIIII III 1 




Db 


3262 


CATAATGTGCTGTTTGACATGCTGACTGTCGAAGAGCACATCTGGTTCTATGCGCGCCTA 


3321 


Qy 


2941 


AAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCAGATGGCCCTGGATGTTGGT 


3000 




II llllllll IIIIIIIIIIMII II IIIIIIIIIIIIIIIIIIIIIMIIIIII 




Db 


3322 


AAGGGGCTCTCAGAGAAGCACGTGAAAGCAGAGATGGAGCAGATGGCCCTGGATGTTGGC 


3381 


Qy 


3001 


TTGCCATCAAGCAAGCTGAAAAGCAAAACAAGCCAGCTGTCAGGTGGAATGCAGAGAAAG 


3060 




II II 1 IIIIIIIIIIIIIIIMIII II IIIII llllllll llllllllllll 




Db 


3382 


TTACCCCCGAGCAAGCTGAAAAGCAAAACGAGTCAGCTCTCAGGTGGGATGCAGAGAAAG 


3441 


Qy 


3061 


CTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTCTGGATGAACCCACA 


3120 


Db 


3442 


II iiiiiiiiiimiiii II II mil iiiiiiimiiiim ii mm 

CTGTCTGTGGCCTTGGCCTTCGTGGGTGGATCCAAGGTTGTCATTCTGGACGAGCCCACA 


3501 


Qy 


3121 


GCTGGTGTGGA CCCTTA CTCCCGCAGGGGAATATGGGAGCTGCTGCTGAAATACCGACA A 


3180 




II II IIIMIII IIIII IIIIIIIIIIIIIIIII II IIIII llllllll III 




Db 


3502 


GCCGGGGTGGA CCCGTACTCTCGCAGGGGAATATGGGAACTCCTGCTAAAATA CCGGCAA 


3561 


Qy 


3181 


GGCCGCA CCATTATTCTCTCTA C A CACCACATGGATGAAGCGGACGTCCTGGGGGACAGG 


3240 



lllllllllllllll I IIIIIIIIIIIIIIIII IIIII III MM llllllll 



Db 


3562 


GGCCGCACCATTATTTTGTCTACACACCACATGGACGAAGCTGACATCCrrGGGGACAGA 


3621 


Qy 


3241 


ATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCCTGTTTCTGAAGAAC 


3300 




lllllllllll lllllllllllllllll lllllllllllllllllllll INI III 




Db 


3622 


ATTGCCATCATTTCCCATGGGAAGCTGTGTTGTGTGGGCTCCTCCCTGriTri'GAAAAAC 


3681 


Qy 


3301 


CAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGGAATCCTCCCTCAGT 


3360 




III lllllll II Mill IIIIM Mil lllllllllllllllll lllllllll 




Db 


3682 


CAGTTGGGAACGGGTTACTATCTGACCCTGGTTAAGAAAGATGTGGAATCGTCCCTCAGT 


3741 


Qy 


3361 


TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 


3420 




lllllllllllllllll mil mil 1 iiiiiiiiiiiiiiiiiiiiiiiiiii 




Db 


3742 


TCCTGCAGAAACAGTAGCAGCACCGTGTCTTGTCTGAAAAAGGAGGACAGTGTTTCTCAG 


3801 


Qy 


3421 


AGCAGTTCTGATGCTGGCCTGGGCAGCGACCATGAGAGTGACACGCTGACCATCGATGTC 


3480 




IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIMIIIIII 




Db 


3802 


AGCAGTTCTGATGCTGGCCTGGGCAGCGACCATGAAAGTGACACGCTGACCATCGATGTC 


3861 


Qy 


3481 


TCTGCTATCTCCAACCTCATCAGGAAGCATGTGTCTGAAGCCCGGCTGGTGGAAGACATA 


3540 




imiiiiiiiiiiiiiiiiiiiiiiiii iiimiiiiiiiimiiiiii iim 




Db 


3862 


TCTGCTATCTCCAACCTCATCAGGAAGCACGTGTCTGAAGCCCGGCTGGTGGAGGACATT 


3921 


Qy 


3541 


GGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTAAGGAGGGAGCCTTTGTGGAA 


3600 




Mill IIIIIIIIIIIIIIMIIM II Mill II IIIIIIIIIMIIIIIIIIII 




Db 


3922 


GGGCACGAGCTGACCTATGTGCTGCCGTACGAAGCCGCGAAGGAGGGAGCCTTTGTGGAA 


3981 


Qy 


3601 


CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 


3660 




Mill lllllllllllllllllllllllllllllllllll II IMIMIIIIIIII 




Db 


3982 


CTCTTCCATGAGATTGATGACCGGCTCTCAGACCTGGGCATCTCCAGTTATGGCATCTCG 


4041 


Qy 


3661 


GAGACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTGGGGTGGATGCTGAG 


3720 




Mill MIMIIIIIIIIIIIIIIIIM Mill IMIIIII lllllllllllllll 




Db 


4042 


GAGACCACCCTGGAAGAAATATTCCTCAAAGTGGCTGAAGAGAGCGGGGTGGATGCTGAG 


4101 


Qy 


3721 


ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 


3780 




IIIIIIIMIIIII IIIIIIIIIIIIIMIIIII IIIIIIIIIIIIIIIIIMIIIII 




Db 


4102 


ACCTCAGATGGTA CTTTGCCAGCAAGA CGA AA CAGACGGGCCTTCGGGGACAAGCAGAGC 


4161 


Qy 


3781 


TGTCTTCGCCCGTTCACTGAAGATGATGCTGCTGATCCAAATGATTCTGACATAGACCCA 


3840 




Mill 1 III II II lllllllllllll IIIIM Mill lllllllllllllll 




Db 


4162 


TGTCTGCACCCATTTA CGGAAGATGATGCTGTTGATCCC A ATGA CTCTGACATAGACCCA 


4221 


Qy 


3841 


GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 


3900 




IIIIIIM Mill III llllllllllllllll llllllll Mlllllll MM 




Db 


4222 


GAATCCAGGGAGACCGACCTGCTCAGTGGGATGGACGGCAAAGGCTCCTACCAGCTGAAG 


4281 


Qy 


3901 


GGCTGGAAACTTACACAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 


3960 




lllllllllll M IIIMIIIIMIIIMIIMIIIIIIIIIIII Mill IIIIM 




Db 


4282 


GGCTGGAAACTCACCCAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGGCTGCTGATTGCC 


4341 


Qy 


3961 


AGACGGAGTCGGAAAGGATTTTTTGCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 


4020 




llllllll Mill II M MMIMIIIIIMI llllllllll Mill IIIIM 




Db 


4342 


AGACGGAGCCGGAAGGGTTTCTTTGCTCAGATTGTCCTGCCAGCTGTCTTTGTTTGCATT 


4401 


Qy 


4021 


GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 


4080 






Mill M IIIMIIIIII IIIIIIIIIIIIIIIIIMIMIIIIMIIIIIIIIIII 




Db 


4402 


GCCCTGGTCTTCAGCCTGATTGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 


4461 


Qy 


4081 


CCCTGGATGTACAACGAACAGTACACATTTGTCAGCAATGATGCTCCTGAGGACACGGGA 


4140 




lllllllllll II II Mill IIIIMIIIM IIIIIMIIM IIIIMI III 




Db 


4462 


CCCTGGATGTATAATGAGCAGTATACATTTGTCAGTAATGATGCTCCCGAGGACATGGGC 


4521 


Qy 


4141 


ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 


4200 




MM lllllll 1 II II M IIMIIII II Mill IIIIMIIIIIIIIIIII 




Db 


4522 


ACCCAGGAA CTCCTGAATGCTCTGA CCAAAGATCC AGGCTTTGGGACCCGCTGTATGGAA 


4581 


Qy 


4201 


GGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGTGGACCACTGCCCCA 


4260 






IIIIMIIIIIIilMI II M III III Mllllll M lllllll III 




Db 


4582 


GGAAACCCAATCCCAGATACCCCTTGCTTGGCTGGGGAGGAGGACTGGACCATCAGCCCC 


4641 


Qy 


4261 


GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 


4320 



II lUIIII Mil lllllllllllllilllll Mllllll III lllilll ill 

Db 4642 GTCCCCCAGAGCATCGTGGACCTCTTCCAGAATGGAAACTGGACCATGAAGAACCCCTCA 47 01 



Qy 4 321 CCTGCATGCCAGTGTAGCAGCGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 4380 

Mill llllllllllllll lllllllllllllllllllllllllllllllllllllll 

Db 4702 CCTGCGTGCCAGTGTAGCAGTGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 4761 

Qy 4 381 GCAGGGGGGCTGCCTCCTCCAC 4402 

llllllllllllll Mill I 
Db 4762 GCAGGGGGGCTGCCACCTCCTC 4783 



SEQ ID NO: 2 



Result Query 



No, 


Score 


Match Length 


DB 


ID 


Description 


1 


11469 


100 . 


. 0 


2201 


21 


AAy79380 


Human ATP binding 


2 


11469 


100 , 


. 0 


2201 


22 


AAE13021 


Human ATP binding 


3 


11469 


100 . 


. 0 


2201 


22 


AAM50227 


Human ATP binding 


4 


11469 


100 . 


. 0 


2201 


23 


ABP65164 


Hypoxia- regulated 


5 


11469 


100 . 


, 0 


2261 


22 


AAE13022 


Human ATP binding 


6 


11469 


100 . 


. 0 


2261 


22 


AAM5 0228 


Human ATP binding 


7 


11469 


100 . 


. 0 


2261 


22 


AAU02176 


Human ABCl . Homo 


8 


11469 


100 . 


. 0 


2261 


22 


AAU02177 


Human ABCl mutant 


9 


11469 


100 . 


. 0 


2261 


23 


ABB83111 


Human ABCAl transp 


10 


11469 


100 . 


. 0 


2261 


23 


ABP52092 


Homo sapiens ABC t 


11 


11469 


100 . 


. 0 


2261 


23 


AAE23000 


Human ABCl full-le 


12 


11468 


100. 


.0 


2261 


23 


ABB83117 


Polymorphic human 


13 


11468 


100. 


.0 


2261 


23 


ABB83124 


Polymorphic human 


14 


11466 


100 . 


. 0 


2261 


22 


AAU02183 


Human ABCl mutant 


15 


11466 


100. 


.0 


2261 


22 


AAU02188 


Human ABCl mutant 


16 


11466 


100. 


.0 


2261 


23 


ABB83115 


Polymorphic human 


17 


11466 


100. 


.0 


2261 


23 


ABB83116 


Polymorphic human 


18 


11466 


100. 


.0 


2261 


23 


ABB83119 


Polymorphic human 


19 


11466 


100. 


.0 


2261 


23 


ABB83122 


Polymorphic humsin 


20 


11466 


100. 


.0 


2261 


23 


ABB83123 


Polymorphic human 


21 


11465 


100. 


.0 


2261 


22 


AAU0218 9 


Human ABCl mutant 


22 


11464 


100. 


.0 


2261 


23 


ABB83121 


Polymorphic human 


23 


11463 


99. 


.9 


2261 


22 


AAU02181 


Human ABCl mutant 


24 


11462 


99. 


.9 


2261 


22 


AAM78550 


Human protein SEQ 


25 


11462 


99. 


.9 


2263 


22 


ABB11956 


Human ABCAl homolo 


26 


11462 


99. 


.9 


2263 


22 


AAM79534 


Human protein SEQ 


27 


11461 


99. 


.9 


2261 


22 


AAU02182 


Human ABCl mutant 


28 


11461 


99 , 


.9 


2261 


22 


AAU02186 


Human ABCl mutant 


29 


11459 


99 , 


. 9 


2261 


23 


ABB83118 


Polymorphic human 


30 


11458 


99 , 


. 9 


2261 


23 


ABB83120 


Polymorphic human 


31 


11440 


99 , 


.7 


2261 


21 


AAB38082 


Human ABCl cholest 


32 


11440 


99, 


.7 


2261 


22 


AAB71749 


Human ABCl protein 


33 


11440 


99, 


.7 


2261 


24 


ABU118 99 


Human ATP -binding 


34 


11439 


99 


.7 


2261 


23 


ABB81578 


Human ABC-A-1-1 pr 


35 


11437 


99, 


.7 


2261 


21 


AAB38109 


Human ABCl cholest 


36 


11437 


99, 


.7 


2261 


21 


AAB38111 


Human ABCl cholest 


37 


11437 


99, 


.1 


2261 


21 


AAB38114 


Human ABCl cholest 


38 


11437 


99, 


.7 


2261 


21 


AAB38115 


Human ABCl cholest 


39 


11437 


99, 


.7 


2261 


21 


AAB38117 


Human ABCl cholest 


No. 


Score 


Match 


Length 


CB 


ID 


Description 


1 


6909 


60 


.2 


1375 


3 


US-08-665-259-26 


Sequence 26, Appl 


2 


6909 


60 


.2 


1375 


3 


US-08-762-500-26 


Sequence 26, Appl 


3 


3129.5 


27 


.3 


1457 


3 


US-08-665-259-27 


Sequence 27, Appl 


4 


3129.5 


27 


.3 


1457 


3 


US-08-762-500-27 


Sequence 27, Appl 


5 


2635 .5 


23 


.0 


1684 


3 


US-08-665-259-25 


Sequence 25, Appl 


6 


2635 .5 


23 


.0 


1684 


3 


US-08-762-500-25 


Sequence 25, Appl 


7 


2635 .5 


23 


.0 


1704 


3 


US-08-762-500-75 


Sequence 75, Appl 



8 


359 


3 


1 


315 


4 


US-09-328 


-352-4388 


Sequence 


4388, Ap 


9 


354 


3 


.1 


332 


4 


US-09-107 


-532A-3752 


Sequence 


3752, Ap 


10 


352.5 


3 


.1 


335 


4 


US-09-252 


-991A-20837 


Sequence 


20837, A 


11 


352 


3 


.1 


322 


4 


US-09-107 


-532A-4662 


Sequence 


4662, Ap 


12 


346 


3 


.0 


589 


4 


US-09-328 


-352-7592 


Sequence 


7592, Ap 


13 


336.5 


2 


.9 


1280 


2 


US-08-583 


-276-19 


Sequence 


19, Appl 


14 


335.5 


2 


.9 


291 


4 


US-09-107 


-532A-4205 


Sequence 


4205, Ap 


15 


335.5 


2 


.9 


1280 


4 


US-09-767 


-594-2 


Sequence 


2 , Appli 


16 


335.5 


2 


.9 


1280 


6 


5206352-4 




Patent No 


5206352 


17 


333 


2 


. 9 


1279 


2 


US-08-784 


-649A-2 


Sequence 


2 , Appli 


18 


332 


2 


. 9 


402 


4 


US-09-107 


-532A-5360 


Sequence 


5360, Ap 


19 


330.5 


2 


.9 


1280 


2 


US-08-752 


-447-2 


Sequence 


2 , Appli 



RESULT 1 

US-08-665-259-26 

Sequence 26, Application US/08665259 
Patent No. 6028173 
GENERAL INFORMATION: 

APPLICANT: Landes, Gregory M. 
APPLICANT: Bum, Ticaothy C. 
APPLICANT: Connors, Timothy D. 
APPLICANT: Dackowski, William R. 
APPLICANT: Van Raay, Terence J. 
APPLICANT: Klinger, Katherine W. 

TITLE OF INVENTION: NOVEL HUMAN CHROMOSOME 16 GENES, 

TITLE OF INVENTION: COMPOSITIONS, METHODS OF MAKING AND USING SAME 
NUMBER OF SEQUENCES: 73 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: GENZYME CORPORATION 
STREET: One Mountain Road 
CITY: Frarainghara 
STATE : Massachusetts 
COUNTRY: United States of America 
ZIP: 01701 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC -DOS/MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.3 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/665 , 259 
FILING DATE: 17-JUN-1996 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Dugan, Deborah A. 
REGISTRATION NUMBER: 37,315 
REFERENCE/DOCKET NUMBER: IG5-9.1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (508) 872-8400 
TELEFAX: (508) 872-5415 
INFORMATION FOR SEQ ID NO: 26: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1375 amino acids 
TYPE : amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY : unknown 
MOLECULE TYPE: protein 
US-08-665-259-26 

Query Match 6 0.2%; Score 69 09; DB 3; Length 1375; 

Best Local Similarity 96.9%; Pred. No. 0; 

Matches 1332; Conservative 21; Mismatches 22; Indels 0; Gaps 0; 
Qy 82 7 CMEEEPTHLKLGVS I QNLVKVYRDGMKVAVDGLALNFYEGQ ITS FLGHNGAGKTTTMS I L 886 

MllllllhlllllllllllllllllllllllMIIIIIIIIIIIIIIIIIIIIMIII 

Db 1 CMEEE PTHLRLGVS I QNLVKVYRDGMKVAVDGLALNFYEGQ ITS FLGHNGAGKTTTMS I L 6 0 



Qy 



887 TGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 946 

IIIIIIIIIIIIIMIIIIIIIhllllllllllllllllllllllllllllMIIIIII 



Db 


61 


TGLFPPTSGTA Y ILGKDIRSEMSS IRQNLGVCPQHNVLFDMLTVEEHI WFYARLKGLSEK 


12 0 


Qy 


947 


HVKAEMEQMALDVGLPS S KLKSKTSQLSGGMQRKLSVALA FVGGS KW I LDE PTAGVDP Y 


1006 




MIIIIMIIIIMII IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 




Db 


121 


HVKAEMEQMALDVGLPPSKLKS KTSQLSGGMQRKLSVALA FVGGS KWI LDE PTAGVDP Y 


180 


Qy 


1007 


SRRGIWELLLKYRQGRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGY 


1066 




IIIMIIIIIIIIIIIMIIIIIIIIIIhlllllllllllllllllllMIMIIIMI 




Db 


181 


SRRGIWELLLKYRQGRTI ILSTHHMDEADILGDRI AI ISHGKLCCVGSSLFLKNQLGTGY 


240 


Qy 


1067 


YLTLVKKDVE S SLSS CRNS SSTVS YLKKEDS VSQSS SDAGLGSDHE SDTLT I DVS A I SNL 


1126 




IIMIIMIillllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 




Db 


241 


YLTLVKKDVESSLSS CRNS SSTVS CLKKEDS VSQS S S DAGLGSDHE SDTLT I DVS A I SNL 


300 


Qy 


1127 


IRKHVSEARLVEDI GHELTYVLPYEAAKEGAFVELFHE IDDRLSDLGI SSYGI SETTLEE 


1186 




IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIMII 




Db 


301 


IRKHVSEARLVEDI GHELTYVLPYEAAKEGAFVELFHE IDDRLSDLGI SSYGI SETTLEE 


360 


Qy 


1187 


I FLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDI DPESRETD 


1246 




llllllllllllllllllllllllllllllllllll lllllll lllllllllllllll 




Db 


361 


I FLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSD I DPESRETD 


420 


Qy 


1247 


LLSGMDGKGSYQVKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 


1306 




lllllllllllhlllllllllllllllllllllMIIIIIIIIIIIIIIIMIIIIIII 




Db 


421 


LLSGMDGKGSYQLKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 


480 


Qy 


1307 


I VPPFGKYPSLELQPWMYNEQ YTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNP I PD 


1366 




llllllllllllllllllllllllllllllll II Illlllllllllllllllllllll 




Db 


481 


I VP P FGKYPSLELQ PWMYNEQ YTFVSNDA PEDMGTQELLNALTKDPGFGTRCMEGNP I PD 


540 


Qy 


1367 


TPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPP 


1426 






III Nihil :|MhhllMlllllhllMIIIIIIIIIIMIIIIIIIIIIIII 




Db 


541 


TPCLAGEEDWTISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPP 


600 


Qy 


1427 


PQRKQNTAD ILQDLTGRNI SDYLVKTYVQ 1 1 AKSLKNKI WVNEFRYGGFSLGVSNTQALP 


1486 




Hill lllllhllllllMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIhllll 




Db 


601 


PQRKQKTAD ILQNLTGRNI SDYLVKTYVQ I I AKSLKNKI WVNEFRYGGFSLGVSNSQALP 


660 


Qy 


1487 


PSQEVNDATKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHA I SSFL 


1546 






II IMII Hill HI Ihlllllhllllll lllhlllllHHIIIIIHIH 




Db 


661 


PSHEVNDA I KQMKKLLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHA I SSFL 


720 


Qy 


1547 


NVINNA ILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVS I CVI FAMS 


1606 




HHHHHHHIHH 1 1 II II 1 1 1 II II 1 1 1 II II 1 H II 1 1 II II 1 1 II 1 1 1 1 




Db 


721 


NVINNAILRANLQKGENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMS 


780 


Qy 


1607 


FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 


1666 




lllllll II III! II IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIII III III III 




Db 


781 


FVP AS F WFL I QERVS KA KHLQF I SGVKPVI YWLSNFVWDMCNYWPATLVI 1 1 F I CFQQ 


840 


Qy 


1667 


KS YVSSTNLPVLALLLLLYGWS ITPLMYPAS FVFKI PSTA YWLTSVNLF IGINGSVATF 


1726 




Hill HH II HH II HIIIIIIIIIIIIIHIIHIIHII II HIHIIIIIIIH 




Db 


841 


KS YVSSTNLPVLALLLLLYGWS ITPLMYPAS FVFKI PSTA YWLTSVNLF IGINGSVATF 


900 


Qy 


1727 


VLELFTDNKLNNIND ILKS VFL I FPHFCLGRGL I DMVKNQAMADALERFGENRFVSPLS W 


1786 




1 1 H 1 h II 1 h 1 II II 1 1 II II H 1 H 1 H H 1 1 II H H 1 H II H 1 1 H II 1 1 H 1 1 




Db 


901 


VLELFTNNKLND IND I L KS VFL I FPHFCLGRGL I DMVKNQ AMADALERFGENRFVS PLSW 


960 


Qy 


1787 


DLVGRNLFAMAVEGWFFL ITVL IQ YRFF I RPRPVNAKLSPLNDEDEDVRRERQRI LDGG 


1846 






IIHIIIIIIIIIIIHHIIIIIIIIHIIIIII III IIIIIIIIIIIIIIIIIIH 




Db 


961 


DLVGRNLFAMAVEGWFFL ITVL IQ YRFF IRPRPVKAKLPPLNDEDEDVRRERQRILDGG 


1020 


Qy 


1847 


GQND I LE I KELTKI YRRKRKP AVDR I CVGI P PGE CFGLLGVNQAGKSSTFKMLTGDTTVT 


1906 






llllllllllllllllllllllllllhllllllllllllllllllhlllllllll II 




Db 


1021 


GQND I LE I KELTK I YRRKRKPA VDR I C I GI P PGE CFGLLGVNGAGKSTTFKMLTGDTPVT 


1080 


Qy 


1907 


RGDAFLNRNS I LSNI HE VHQNMGYCPQFDA ITELLTGREHVE FFALLRGVPE KE VGKVGE 


1966 






HHIIhlHIIHIIIHIHIIIHIIHIHIHIHIIHIIHIHIHII II 




Db 


1081 


RGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGE 


1140 


Qy 


1967 


WAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNC 


2026 



Illlllllllllllll illlllllllllilllllllllllllllllillllllllllll 

Db 1141 WAIRKLGLVKYGEKYASNYSGGNKRKLSTAMALIGGPPVVFLDEPTTCaiDPKARRFLWNC 1200 

Qy 2027 ALSVVKEGRSVVLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIVVRIA 2 086 

llhMlillllllllllllllllllllilillllllilMIIIIIIIIIIIIIIIIIII 

Db 1201 ALSIVKEGRSVVLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIVVRIA 1260 

Qy 2087 GSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 2146 

lllllllllhllllllllll lllllllllllllllllllllllillllllllllllll 
Db 1261 GSNPDLKPVQEFFGLAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 1320 

Qy 2147 SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTVVDVAVLTSFLQDEKVKESYV 22 01 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 1321 SVSQTTLDQVFVNFABODQSDDDHLKDLSLHKNQTVVDVAVLTSFLQDEKVKESYV 137 5 



RESULT 2 

US-08-762-500-26 

Sequence 26, Application US/08762500 
Patent No. 6030806 
GENERAL INFORMATION: 

APPLICANT: Landes, Gregory M. 
APPLICANT: Bum, Timothy C. 
APPLICANT: Connors, Timothy D, 
APPLICANT: Dackowski, William R. 
APPLICANT: Van Raay, Terence J. 
APPLICANT: Klinger, Katherine W. 

TITLE OF INVENTION: NOVEL HUMAN CHROMOSOME 16 GENES, 

TITLE OF INVENTION: COMPOSITIONS, METHODS OF MAKING AND USING SAME 
NUMBER OF SEQUENCES: 83 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: GENZYME CORPORATION 
STREET: One Mountain Road 
CITY: Framingham 
STATE : Massachusetts 
COUNTRY: United States of America 
ZIP: 01701 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC -DOS/MS -DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/762 , 500 
FILING DATE: 09-DEC-1996 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/665,259 
FILING DATE: 17-JUN-1996 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US96/10469 
FILING DATE: 17-JUN-1996 
ATTORNEY /AGENT INFORMATION: 
NAME: Dugan, Deborah A. 
REGISTRATION NUMBER: 37,315 
REFERENCE /DOCKET NUMBER: IG5-9.3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (508) 872-8400 
TELEFAX: (508) 872-5415 
INFORMATION FOR SEQ ID NO : 26: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 137 5 amino acids 
TYPE : amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY : unknown 
MOLECULE TYPE: protein 
US-08-762-500-26 



Query Match 



60.2%; Score 6909; DB 3; Length 1375; 



Best Local Similarity 96.9%; Pred. No. 0; 

Matches 1332; Conservative 21; Mismatches 22; Indels 0; Gaps 



Qy 


827 


CMEEEPTHLKLGVSIQNLVKVYRDGMKVAVIJ^ ii>rijvyriNLjA(jJ\.i i ino j.±j 


886 




llllllllhlllllllllllllllillllllllllllllllllllilllllllllllll 




Db 


1 


CMEEEPTHLRLGVSIQOTjVKVYRDGMKVAVDGLALNFYBGQITSFIiGHNGAGK'riTMSIL 


60 


Qy 


887 


TGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMliTV^EnlWFi^ 






llllllllllllllllllllllhllllllllllllllllllllllllllllllllllll 




Db 


61 


TGLFPPTSGTAYILGKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEBHIWFYARLKGLSEK 


120 


Qy 


947 


HVKAEMEQMALDVGLPSSKLKSKTSQLSGGMQRKLSVALAFVGGSK 


1 nnfi 
xviuo 




llllllllllllllll lllllllllllllllllllllllllllllllllllllllllll 




Db 


121 


HVKAEMEQMALDVGLPPSKLKSKTSQLSGCTIQRKLSVALAFVGGSKWILDEPTAGVDPY 


180 


Qy 


1007 


SRRGIWELLLBTiRQGRTIILSTHHMDEADVLGDRIAI ISnGlUjCCV(jbbljrl<i\rJyJj(jlAji 


X UO D 




llllllllllllllllllllllllllllhllllllllllllllllllllllllllllil 




Db 


181 


SRRGIWELLLKYRQGRTIILSTHHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGY 


240 


Qy 


1067 


YLTLVKKDVESSLSSCRNSSSTVS I JjKKbU lUVoAXorjij 


112 6 




IIIIIIIIIIMIIIIIIIIIIII lllllllllllllllllllllllllllllllllll 




Db 


241 


YLTLVKKDVESSLSSCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 


300 


Qy 


1127 


IRKHVSEARIiVEDIGHELTyVijFYEAAKEGAr VbJjrnliiU HjI&Ci i iijriEi 


1 1 ftfi 
xxo o 




IIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIII 




Db 


301 


IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 


360 


Qy 


1187 


IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETD 


1246 




llllllllllllllllllllllllllllllllllll IIIMII lllllllllllllll 




Db 


361 


IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETD 


420 


Qy 


1247 


LLSGMDGKGS YQVKGWKLTQQQ FVALL WKRLL I ARRSRKGF FAQ I VL P A VFVC I ALVFS L 


1306 




IIMIIIIIIIhlllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 




Db 


421 


LLSGMDGKGSYQLKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 


480 


Qy 


1307 


I VPPFGKYPSLELQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNP I PD 


136 6 




IIMIIIIIIIIIIIIIIIIIIIIIIIIIIII II IIIMIIIIIIIIIIIIIIIIIII 




Db 


481 


IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPD 


540 


Qy 


1367 


TPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPP 


142 6 






III nihil nilhlHIIIIIIIIhllllllllllMIIIIIIIIIIIIIIIII 




Db 


541 


TPCLAGEEDWTISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPP 


600 


Qy 


1427 


PQRKQNTAD I LQDLTGRNI SDYLVKTYVQ I I AKSLKNKI WVNEFRYGGFSLGVSNTQALP 


148 6 




Mill lllllhllMlllllllllllllllllllllllllllllllllllllhllll 




Db 


601 


PQRKQKTAD ILQNLTGRNI SDYLVKTYVQ I I AKSLKNKI WVNEFRYGGFSLGVSNSQALP 


660 


Qy 


1487 


PSQE VNDATKQMKKHLKLA KDS S ADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHA I S S FL 


1546 






II Mill Mill III IhllMlhllllll lllhlllllllllllllllllM 




Db 


661 


PSHEVNDAIKQMKKLLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAISSFL 


720 


Qy 


1547 


NVI NNA I LRANLQKGENP SHYGI TA FNHPLNLTKQQLSE VA PMTTS VDVLVS I CV I FAMS 


1606 






1 II II M 1 1 1 1 M M 1 1 1 1 1 II 1 1 1 1 1 II II II 1 II II II II 1 II 1 II 1 1 1 1 1 1 1 1 II 




Db 


721 


NV INNA I LRANLQKGENPS QYGI TAFNHPLNLTKQQLSE VALMTTS VDVLVS I CV I FAMS 


780 


Qy 


1607 


FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVI I IFICFQQ 


1666 






MIMIIIIIIMIIIIMIIMIIIIIIIMIIIIIIIIIIIIIIMIIIIIMIIIII 




Db 


781 


FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVI IIFICFQQ 


840 


Qy 


1667 


KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 


1726 




lllllllllllllll II II MM lllllllllllllll Mil II nil MM MM II II 




Db 


841 


KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 


900 


Qy 


1727 


VLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 


1786 






lllllhlllhllMIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIinni 




Db 


901 


VLELFTNNKLNDINDI LKSVFL I FPHFCLGRGL IDMVKNQAMADALERFGENRFVS PLS W 


960 



Qy 

Db 



1787 
961 



DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGG 

IMMMMMIMMMMIIIiniMMIMI III II 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 

DLVGRNLFAMAVEGWFFL I TVL IQYRFF IRPRPVKAKLPPLNDEDEDVRRERQRI LDGG 



1846 
1020 



Qy 


1847 


GQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVT 


1906 




Mllllllllllllllllllllllllhllllllilllllllllllhlllllllli II 




Db 


1021 


GQNDILE I KELTKI YRRKRKPAVDRIC IGI PPGECFGLLGVNGAGKSTTFKMLTGDTPVT 


1080 


Qy 


1907 


RGDAFLNRNSILSNXHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGE 


1966 




llllllhlllllllllllllllllllllllllllllllllllllllllllllllll II 




Db 


1081 


RGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGE 


1140 


Qy 


1967 


WA IRKLGLVKYGEKYAGNYSGGNKRKLSTAMAL IGGPPVVFLDEPTTGMDPKARRFLWNC 


2026 




IIIIIMIIIIIIIII lllllllllllllllllllllllllllllllllllllllllll 




Db 


1141 


WA I RKLGLVKYGE KYASNYSGGNKRKLSTAMAL I GGPP WFLDE PTTC^IDP KARRFL WNC 


1200 


Qy 


2027 


ALS WKEGRS WLTSHSMEECE ALCTRMA I MVNGRFRCLGS VQHLKNRFGDGYTI WRI A 


2086 




llhlllllllllllllllllllllllllllllllllMIIIIIIIIIIIIIIIIIIIII 




Db 


1201 


ALS IVKEGRSWLTSHSMEECEALCTRMA IMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 


1260 


Qy 


2087 


GSNPDLKPVQDF FGIjAF PGSVPKEKxlRNMLy lyijlro 


2146 




lllllllllhllllllllll llllllllllllllllllllllllllllllllllllll 




Db 


1261 


GSNPDLKPVQEFFGLAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 


1320 


Qy 


2147 


SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 22 01 






lllllllllllilllllllllllllllllllllllllllllllllllllllllll 




Db 


1321 


SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTVVDVAVLTSFLQDEKVKESYV 1375 








SUMMARIES 

% 





Result Query 

No. Score Match Length DB ID Description 



1 


10906 


95, 


.1 


2201 


2 


A54774 


ATP binding casset 


2 


3338 .5 


29, 


.1 


1529 


2 


A59189 


ATP-binding casset 


3 


3129.5 


27, 


.3 


1472 


2 


B54774 


ATP binding casset 


4 


2638.5 


23, 


.0 


1704 


2 


S71363 


probable ATP-bindi 


5 


2635.5 


23, 


.0 


1704 


2 


A59188 


ATP-binding casset 


6 


2061 


18 , 


.0 


1802 


2 


T33783 


hypothetical prote 


7 


1975 


17 , 


.2 


1816 


2 


A84845 


probable ABC trans 


8 


1854.5 


16 , 


.2 


1447 


2 


T15200 


hypothetical prote 


9 


1792 


15 


.6 


1317 


2 


C88925 


protein F33E11.4 [ 


10 


1535 .5 


13 


.4 


1758 


2 


F88559 


protein C48B4.4b [ 


11 


1528.5 


13 


.3 


1704 


2 


T42749 


ATP-binding casset 


12 


1526 


13 , 


.3 


1767 


2 


S60124 


transport protein 


13 


1393 


12 


.1 


1246 


2 


T00826 


hypothetical prote 


14 


1150.5 


10, 


.0 


1564 


2 


T27121 


hypothetical prote 


15 


1006.5 


8 


.8 


373 


2 


T47150 


hypothetical prote 


16 


1005 .5 


8 


.8 


1431 


2 


T2274S 


hypothetical prote 


17 


846 


7 


.4 


269 


2 


T46467 


hypothetical prote 


18 


786.5 


6 


.9 


1011 


2 


T07712 


probable ABC-type 


19 


695 


6 


.1 


900 


2 


T07717 


probable ABC -type 



RESULT 1 
A54774 

ATP binding cassette trsinsporter ABCl - mouse 
C;Specie3: Mus musculus (house mouse) 

C;Date: 05-Apr-1995 #sequence_revision 05-Apr-1995 #text_change 02-Feb-2 001 
C; Accession: A54774 

R;Luciani, M.F.; Denizot, F.; Savary, S.; Mattel, M.G.; Chimini, G. 
Genomics 21, 150-159, 1994 

A;Title: Cloning of two novel ABC transporters mapping on human chromosome 9. 
A;Reference number: A54774; MUID : 94375 008 ; PMID:80ae7e2 
A;Accession: A54774 
A /Molecule type: mRNA 
A;Residues: 1-2201 <LUC> 

A; Cross-references: GB:X75926; NID:g495256; PIDN : CAA53530 . 1 ; PID:g495257 

C; Super family : unassigned ATP-binding cassette proteins; ATP-binding cassette homology 

C;Keywords: ATP; duplication; nucleotide binding; P-loop 

F; 856-1047/Domain : ATP-binding cassette homology <ABC1> 

F; 873-880/Region: nucleotide-binding motif A (P-loop) 



F;1869-2060/Doiiiain: ATP-binding cassette homology <ABC2> 
F;1886-1893/Region: nucleotide-binding motif A (P-loop) 



Query Match 95.1%; Score 10906; DB 2; Length 22 01; 

Best Local Similarity 94.8%; Pred. No. 0; 

Matches 2087; Conservative 54; Mismatches 60; Indels 0; Gaps 



Qy 


1 


MPSAGTLPWVQGI ICNANNPCFRYPTPGEAPGWGNFNKS IVARLrSDARK^ 


o u 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIhlllllhlllllihIl 




Db 


1 


MPSAGTLPWVQGI ICNANNPCFRYPTPGEAPGWGNFNKS IVSRLFSDAQRLLLYSQRDT 


60 


Qy 


61 


SMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNLSLPRii VUJ^ 


12 0 




hill INI hill HIIIIIIIIIIIMIIIII lllllhllll :h :| 1 




Db 


61 


S I KDMHKVLRMLRQ I KHPNSNLKLQDFLVDNETFSGFLQHNLSLPRSTVDSLLQXNVGLQ 


120 


Qy 


121 


KVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPIL 


Ton 




lllllllllll lllllll Ihlllll III lllllhll lllllll IIIIMh = 




Db 


121 


KVFLQGYQLHLASLCNGSKLBEIIQLGDAEVSALCGLPRKKLDAAERVLRYNMDILKPW 


180 


Qy 


181 


RTLNSTSPFPSKEIjAEATKTLLHSLGTLAQELFSMRSWSDPlRQBVMr 


24 0 




mil h: Mill II III lllllll :||IMIIIIIIIIIIIIIIIIIM 




Db 


181 


TKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTKSWSDMRQEVMFLTNVNSSSSSTQI 


240 


Qy 


241 


YQ AVS RI VCGHPEGGGL K I KS LNWYE DNNY KAL F6GNGTE E D AE TF YDNS TTP YCNDLM K 


inn 
J u u 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II MM MIIIIMIIIIIIIII 

lllllllllllllllllllllllllllllllllllll llll IlllflllllllVIII 




Db 


241 


YQAVSRIVCGHPEGGGLKIKS LNWYE DNNYKALFGGNNTEEDVDTFYDNSTTPYCNDLMK 


300 


Qy 


301 


NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 


360 




III II II II II II II MM II II II II II MM II Mill II II II II II MM II II II 




Db 


301 


NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEC»1WEELS 


360 


Qy 


361 


PKI WTFMENSQEMDLVRMLLDSRDNDHFWEQQLDGLDWTAQD I VAFLA KHPEDVQSSNGS 


420 




hlllllllllllllll Mill II 1 M hi 1 1 1 M M M h 1 1 M h II II II III 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II llll lllllllllll 11)11 ■ 1 1 1 1 1 III 




Db 


361 


PQ I WTFMENS QEMDLVRTLLDSRGNDQ F WE Q KLDGLDWTAQD I MAFLA KNPEDVQ S PNGS 


420 


Qy 


421 


VYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTG 


480 




IIIIIIIMIIIMhIIIIIIIIIMIIIIII III II II II M II 1 1 1 M 1 1 1 II II 




Db 


421 


VYTWREAFNETNQAIQTISRFMECVNLNKLEPIPTEVRLINKSMELLDERKFWAGIVFTG 


480 


Qy 


481 


I TPGS lELPHHVKYKI RMDI DNVERTNKI KDGYWDPGPRADPFEDMRYVWGGFA YLQDW 


540 




III hllllllllllllMIIIMIIMIIIIIIIIMIMIIIIIIIIIIIMIIIII 
III i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■< 1 1 1 1 1 >• I <<■■■ ■ 




Db 


481 


I TPDSVELPHHVKYKI RMD I DNVERTNK I KDGYWDPGPRADPFEDMRYVWGGFA YLQDW 


540 


Qy 


541 


E QA 1 1 RVLTGTE KKTGVYMQQMP YPC YVDD I FLRVMSRSMPLFMTLA W I YS VAVI I KG I V 


600 




1 1 1 1 II 1 1 1 h 1 1 1 1 II h II II II M M II 1 1 M M 1 II M II M II 1 M M 1 M 1 II 

1(11111111 lllllll |||llllllllllllll1IIIIVIIIIII1i>'>>>ll <i 




Db 


541 


EQA II RVLTGSE KKTGVYVQQMPYP C YVDD I FLRVMSRSMPLFMTLA W I YS VAVI I KS IV 


600 


Qy 


601 


YEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 


660 




1 II 1 1 II 1 1 1 1 M 1 1 II 1 1 1 1 II 1 h II 1 II 1 1 1 1 1 1 1 1 II II 1 1 M II 1 1 M 1 M 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


YEKEARLKETMRIMGLDNGILWFSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 


660 


Qy 


661 


FLSVFAWTILQCFLISTLFSRANLAAACGGI I YFTLYLPYVLCVAWQDYVGFTLKI FAS 


72 0 




M 1 II h 1 1 1 II 1 1 1 1 1 1 1 M II 1 II 1 1 II 1 1 1 II II II 1 M M II 1 1 II II hM 1 1 II 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I I 1 1 I 1 1 1 1 1 1 1 1 1 




Db 


661 


FLS VFAMVT I LQCFL I STLFSRANLAAA CGG 1 1 YFTL YLP YVLCVA WQDYVGFS I KI F AS 


720 


Qy 


721 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTW 


780 






M II IMIIMIIIIIIIIIIIIIIIIIIIIIII II II llllhllllMIM lllllll 




Db 


721 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTW 


780 


Qy 


781 


YIEAVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVS 


840 






1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 M 1 1 II 1 M M 1 1 h II M M II 1 II 1 M 1 : II M 




Db 


781 


YIEAVFPGQYGIPRPWYFPCTKSYWFGEE IDEKSHPGSSQKGVSE I CMEEEPTHLRLGVS 


840 


Qy 


841 


I QNLVKVYRDGMKVAVDGLALNFYEGQ ITSFLGHNGAGKTTTMS I LTGLFPPTSGTA Y IL 


900 






lllllll IIIIIIIIIIM MM II M llll MUM IIIIIIIIIIIMII II MUM 




Db 


841 


I QNLVKVYRDGMKVAVDGLALNFYEGQ I TS FLGHNGAGKTTTMS I LTGLFPPTSGTA Y I L 


900 


Qy 


901 


GKD I RSEMSTI RQNLGVCPQHNVLFDMLTVEEH I WF YARLKGLSE KHVKAEMEQMALDVG 


960 






II llllllhllllll Mill II llll llll llllll MM lllllllllll MM MM 




Db 


901 


GKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 


960 



Qy 


961 


LPSSKLKSKTSQLSGOIQRiajSVALAFVGGSKWILDEPTAGVDPySRRGIWELLLKYRQ 


1020 




II lllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


961 


LPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 


1020 


Qy 


1021 


GRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 


1080 




llllllllllllllhllllllllilllllllllllllllllllllllllllllllllll 




Db 


1021 


GRTI ILSTHHMDEADILGDRIAI ISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 


1080 


Qy 


1081 


SCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 


1140 




Illlllllll lllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


1081 


SCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 


1140 


Qy 


1141 


GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 


1200 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIi 




Db 


1141 


GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 


1200 


Qy 


12 01 


TSDGTLPARRIORRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVK 


1260 




llllllllllllllllllllll lllllll llllllllllllllllllllllllllhl 




Db 


12 01 


TSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 


1260 


Qy 


1261 


GWKLTQQQFVALL WKRLL I ARRSRKGFFAQ I VLP A VFVC I ALVFSL I VP PFGKYPSLELQ 


1320 




IIIIIIIIIIIIIIIIIIMIIIIIIIIilllllilllillMlllilllllllllllM 




Db 


1261 


GWKLTQQQFVALLWKRLL I ARRSRKGFFAQ I VLPA VFVC I ALVFSL I VPPFGKYPS LELQ 


1320 


Qy 


1321 


PWMY^FEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEG^^PIPDTPCQAGEEEWTTAP 


1380 






llllllllllllllllll II lllllllllllllllllllllllllll Nihil :| 




Db 


1321 


PWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPDTPCLAGEEDWTISP 


1380 


Qy 


1381 


VPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDL 


1440 




MhhlllllMllhlllllllllllllilllllilllillllllllll lllllhl 




Db 


1381 


VPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQKTADILQNL 


1440 


Qy 


1441 


TGRNI SDYLVKTYVQ 1 1 AKSLKNKI WVNE FRYGGFSLGVSNTQALPPSQEVNDATKQMKK 


1500 




IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIhllllll Mill Mill 




Db 


1441 


TGRNI SDYLVKTYVQ II A KSLKNK I WVNE FRYGGFSLGVSNSQALPPS HE VNDA I KQMKK 


1500 


Qy 


1501 


HLKLAKDS S ADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHA I S S FLNV INNA I LRANLQK 


1560 




Ml IhllMlhlMMI lllhlllllMIIIMIIIIIIIIIIIMIMIIMI 




Db 


1501 


LLKLTKDTS ADRFLS SLGRFMAGLDTKNNVKVWFNNKGWHA I S S FLNV INNA I LRANLQ K 


1560 


Qy 


1561 


GENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVS I CVI FAMSFVPASFWFL IQER 


1620 




Mill IIIIIIIIIMIMIIIIIM IIIIIMMIIIIIIMIIIIIIIIMMIII 




Db 


1561 


GENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFWFLIQER 


1620 


Qy 


1621 


VSKAKHLQF I SGVKPVI YWLSNFVWDMCNYWPATLVI I I F I CFQQKS YVSSTNLPVLAL 


1680 






llllll II lllllll MM MM 1 III II II lllllll MIMIIIIMIIMIIMIII 




Db 


1621 


VSKAKHLQF I SGVKPVI YWLSNFVWDMCNYWPATLVI I IFICFQQKSYVSSTNLPVLAL 


1680 


Qy 


1681 


LLLLYGWS I TPLMYPAS FVFKI PSTA YWLTSVNLF I G I NGS VATFVLELFTDNKLNN I N 


1740 




III Illlllllll MM II II llllll MM Mill MM MM III Mllhlllhll 




Db 


1681 


LLLLYGWS I TPLMYPAS FVF KI PSTA YWLTSVNLF I G INGSVATFVLELFTNNKLND I N 


1740 


Qy 


1741 


D I LKS VFL I FPHFCLGRGL I DMVKNQ AMADALERFGENRFVS PLS WDLVGRNL FAMAVEG 


1800 




llllll M III 1 MM lllllllll MM Mill Mill MM II III MM Mill III 




Db 


1741 


D I LKS VFL I FPHFCLGRGL I DMVKNQ AMADALERFGENRFVS PLS WDLVGRNL FAMAVEG 


1800 


Qy 


1801 


WFFL ITVL IQYRFF I RPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILE I KELTKI 


1860 






IIIIIIIIIIIIIIMIIIII III IMIIIIIIIIIIIMMIIIIIIMIIIMIII 




Db 


1801 


WFFL ITVL IQYRFF I RPRPVKAKLPPLNDEDEDVRRERQRILDGGGQNDILE I KELTKI 


1860 


Qy 


1861 


YRRKRKPAVDRI CVG I PPGECFGLLGVNGAGKS STFKMLTGDTTVTRGDA FLNRNS I LSN 


1920 




IMMMMMIhlMMMMMMMMIhlMMMM IIIIIMIhllllll 




Db 


1861 


YRRKRKPAVDRI C I G I PPGECFGLLGVNGAGKSTTF KMLTGDTPVTRGDA FLNK^f S I LSN 


1320 


Qy 


1921 


IHEVHQNMGYCPQFDA ITELLTGREHVEFFALLRGVPEKEVGKVGE WA IRKLGLVKYGEK 


1980 




IIIIIIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIIIIII IMMIIIIIIMMI 




Db 


1921 


I HE VHQNMGYCPQFDA I TELLTGREHVE FFALLRGVPE KEVGKFGE WA I RKLGLVKYGE K 


1980 


Qy 


1981 


YAGNYSGGNKRKLSTAMAL I GGPP WFLDE PTT01DPKARRFLWNCALS WKEGRSWLT 


2040 




M IIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIMIIIMhIIIIIIIIM 




Db 


1981 


YASNYSGGNKRKLSTAMAL I GGPPWFLDE PTTGMDPKARRFLWNCALS I VKEGRSWLT 


2040 



1 



Qy 2041 SHSMBECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQDFFG 2100 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHII 

Db 2 041 SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQEFFG 2100 

Qy 2101 LAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2160 

IIIIMI llllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 2101 LAFPGSVLKEKHRimLQYQLPSSLSSIARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2160 

Qy 2161 AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2 2 01 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIII 

Db 2161 AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2 2 01 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


11466 


100 


.0 


2261 


1 


ABC1_HUMAN 


095477 


homo sapien 


2 


10906 


' 95 


.1 


2261 


1 


ABC1_M0USE 


P41233 


mus musculu 


3 


5689.5 


49 


.6 


2273 


1 


ABCR_HUMAN 


P78363 


homo sapien 


4 


4131 


36 


.0 


2436 


1 


ABC2_HUMAN 


Q9b2c7 


homo sapien 


5 


3989.5 


34 


.8 


2434 


1 


ABC2_M0USE 


P41234 


mus m.usculu 


6 


2635.5 


23 


.0 


1704 


1 


ABC3_HUMAN 


Q99758 


homo sapien 


7 


1528.5 


13, 


.3 


1704 


1 


CED7_CAEEL 


P34358 


caenorhabdi 


8 


411 


3 


.6 


330 


1 


DRRA_STRPE 


P32010 


streptomyce 


9 


380.5 


3 


.3 


343 


1 


NODI_RHISN 


P55476 


rhizobium s 


10 


366 


3 


.2 


3 04 


1 


N0DI_RHIS3 


P72335 


rhizobium s 


11 


347 


3 


.0 


308 


1 


YADG_ECOLI 


P36879 


escherichia 


12 


347 


3 


.0 


335 


1 


NDI2_RHIME 


Q8gnh6 


rhizobium m 


13 


344.5 


3 


.0 


340 


1 


NODI_RHILO 


P23703 


rhizobium 1 


14 


343.5 


3 


.0 


347 


1 


NODI_RHIGA 


P50332 


rhizobium g 


15 


335.5 


2 


.9 


1280 


1 


MDR1_HUMAN 


P08183 


homo sapien 


16 


331.5 


2 , 


.9 


355 


1 


NDI1_RHIME 


052618 


rhizobium m 


17 


329.5 


2 , 


.9 


578 


1 


YBHF_ECOLI 


P75776 


escherichia 


18 


327 


2 


.9 


894 


1 


YHIH_ECOLI 


P37624 


escherichia 


19 


327 


2 


.9 


1276 


1 


MDR3 MOUSE 


P21447 


mus musculu 



RESULT 2 
ABC1_M0USE 

ID ABC1_M0USE STANDARD; PRT; 2261 AA , 

AC P41233; 

DT Ol-FEB-1995 (Rel . 31, Created) 

DT 16-OCT-2 001 (Rel. 40, Last sequence update) 

DT 15-SEP-2 003 (Rel. 42, Last annotation update) 

DE ATP-binding cassette, sub- family A, member 1 (ATP-binding cassette 

DE transporter 1) (ATP-binding cassette 1) (ABC-1) . 

GN ABCAl OR ABCl . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DBA/2; TISSUE=Macrophage; 

RX MEDLINE=94375008; PubMed=:8088782 ; 

RA Luciani M.F., Denizot F., Savary S., Mattei M.-G., Chimini G, ; 

RT "Cloning of two novel ABC transporters mapping on human chromoaotne 

RT 9 . " ; 

RL Genomics 21:150-159(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=21251004; PubMed=11352567 ; 

RA Qiu Y., Cavelier L., Chiu S., Yang X., Rubin E., Cheng J.-F.; 

RT "Human and mouse ABCAl comparative sequencing and transgenesis 



RT 


studies revealing novel regulatory sequences . " ; 


RL 


Genotaics 


73 :66-76 (2001) . 




CC 


-1- FUNCTION: CAMP -DE PENDENT AND SULFONYLUREA-SENSITIVE ANION 


CC 


TRANSPORTER. 


KEY GATEKEEPER INFLX^NCING INTRACELLULAR CHOLESTEROL 


CC 


TRANSPORT (BY SIMILARITY) . 


CC 


-1- TISSUE SPECIFICITY: WIDELY EXPRESSED IN ADULT TISSUES. HIGHEST 


CC 


LEVELS ARE FOUND IN PREGNANT UTERUS AND UTERUS. 


CC 


-1- DOMAIN: MULTIFUNCTIONAL POLYPEPTIDE WITH TWO HOMOLOGOUS HALVES, 


CC 


EACH 


CONTAINING AN HYDROPHOBIC MEMBRANE -ANCHORING DOMAIN AND AN 


CC 


ATP BINDING 


CASSETTE 


(ABC) DOMAIN. 


CC 


- ! - PTM : 


Phosphorylation 


on Ser-2 054 regulates phospholipid efflux (By 


cc 


sitailarity) , 






cc 
cc 
cc 


-1- SIMILARITY: 


BELONGS TO THE ABC TRANSPORTER FAMILY. ABCA SUBFAMILY. 


This SWISS -PROT 


entry is 


copyright. It is produced through a collaboration 


cc 


between 


the Swiss Institute of Bioinf ormatics and the EMBL outstation - 


cc 


the European Bioinf ormatics Institute. There are no restrictions on its 


cc 


use by 


non-profit institutions as long as its content is in no way 


cc 


modified 


and this statement is not removed. Usage by and for commercial 


cc 


entities 


requires a license agreement (See http: //www.isb-sib.ch/annoimce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch) . 


EMBL; X75926; CAA53530.1; 


ALT_INIT. 


DR 


EMBL; AF287263; 


AAG39073 . 


1; ALT_INIT. 


DR 


MGD; MGI: 


99607; 


Abcal . 




DR 


GO; GO: 0008203; 


P : cholesterol metabolism; IDA. 


DR 


GO; GO: 0030301; 


P : cholesterol transport; IDA. 


DR 


InterPro; 


IPR0a3593; AAA_ 


ATPase . 


DR 


InterPro; 


IPR003439; ABC_ 


transporter . 


DR 


Pfam; PF00005; ABC_tran; 


2 . 


DR 


ProDom; PD000006; ABC_transporter ; 2. 


DR 


SMART; SM00382 ; 


AAA; 2. 




DR 


PROSITE; 


PS00211; ABC_TRANSP0RTER_1 ; 1. 


DR 


PROSITE; 


PS508 93; ABC TRANSP0RTER_2 ; 2. 


KW 


ATP -binding; Glycoprotein; Transmembrane; Transport; Phosphorylation. 


FT 


TRANSMEM 


26 


42 


POTENTIAL . 


FT 


TRANSMEM 


640 


656 


POTENTIAL . 


FT 


TRANSMEM 


690 


706 


POTENTIAL . 


FT 


TRANSMEM 


717 


733 


POTENTIAL. 


FT 


TRANSMEM 


749 


765 


POTENTIAL. 


FT 


TRANSMEM 


771 


787 


POTENTIAL . 


FT 


TRANSMEM 


1041 


1057 


POTENTIAL . 


FT 


TRANSMEM 


1351 


1367 


POTENTIAL . 


FT 


TRANSMEM 


1661 


1677 


POTENTIAL . 


FT 


TRANSMEM 


1708 


1724 


POTENTIAL . 


FT 


TRANSMEM 


1737 


1753 


POTENTIAL . 


FT 


TRANSMEM 


1775 


1791 


POTENTIAL . 


FT 


TRANSMEM 


1854 


1870 


POTENTIAL . 


FT 


NP_BIND 


933 


940 


ATP (POTENTIAL) . 


FT 


NP_BIND 


1946 


1953 


ATP (POTENTIAL) . 


FT 


MOD_RES 


1042 


1042 


PHOSPHORYLATION (BY PKA) (MAJOR) (BY 


FT 








SIMILARITY) . 


FT 


MOD_RES 


2054 


2054 


PHOSPHORYLATION (BY PKA) (MAJOR) (BY 


FT 








SIMILARITY) . 


FT 


CARBOHYD 


14 


14 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


98 


98 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


151 


151 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


161 


161 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


196 


196 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


244 


244 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


2 92 


292 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


337 


337 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


349 


349 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


400 


400 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


478 


478 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


489 


489 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


521 


521 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


820 


820 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


1144 


1144 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


1294 


12 94 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


1453 


1453 


N-LINKED (GLCNAC. . .) (POTENTIAL). 



FT 


CARBOHYD 


14 99 1499 N- LINKED (GliCNAv. . . *) \ fUlBNl XAJj; . 




FT 


CARBOHYD 


1504 1504 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CARBOHYD 


1637 1637 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CARBOHYD 


2044 2044 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CARBOHYD 


2238 2238 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CONFLICT 


1567 1568 MISSING (IN REF . 2) . 




FT 


CONFLICT 


2024 2024 MISSING (IN REF. 2) . 




SQ 


SEQUENCE 


2261 AA; 254011 MW; FAE62B21FD1D09F9 CRC64; 




Query Match 


95.1%; Score 10906; DB 1; Length 2261; 




Best Local Similarity 94.8%; Pred. No. 0; 




Matches 2 087; Conservative 54; Mismatches 60; Indels 0; Gaps 


0. 


Qy 


1 


MPSAGTLPWVQGI ICNANNPCFRYPTPGEAPGWGNFNKS IVARLFSDARRLLLYSQKDT 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 I 1 t 1 1 1 1 1 1 1 1 1 t 1 1 1 II 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M M 1 M 1 M 1 M 1 1 1 H M M h 1 M 1 M h M 




Db 


61 


MPSAGTLPWVQGI ICNANNPCFRYPTPGEAPGWGNFNKS I VSRLFSDAQRLLLYSQRDT 


12 0 


Qy 


61 


SMKDMRKVLRTLQQ I KKSSSNLKLQDFLVDNETFSGFLYHNLSLPKSTVDKMLRADVILH 


12 0 




1 III 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 : 1 1 1 1 1 1 1 hill H II II 1 II II II II 1 1 1 II 1 1 1 1 1 h 1 1 1 1 '\' '\ \ 




Db 


121 


S I KDMHKVLRMLRQ I KHPNSNLKLQDFLVDNETFSGFLQHNLSLPRSTVDSLLQXNVGLQ 


180 


Qy 


121 


KVFLQGYQLHLTSLCNGS KSEEM I QLGDQE VSELCGLPRE KLAAAE RVLRSNMDI LKP I L 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 III 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 III 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 - 




Db 


181 


KVFLQGYQLHLASLCNGSKLEE I IQLGDAEVSALCGLPRKKLDAAERVLRYNMDILKPW 


240 


Qy 


181 


RTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTQI 


240 




1 1 1 1 1 1 1 1 1 1 1 II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 - 1 1 1 1 1 II III 1 II 1 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


TKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTKSWSDMRQEVMFLTNVNSSSSSTQI 


300 


Qy 


241 


YQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNGTEEDAETFYDNSTTPYCNDLMK 


300 






1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 H 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


301 


YQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNNTEEDVDTFYDNSTTPYCNDLMK 


360 


Qy 


301 


NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 


360 




t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 


420 


Qy 


361 


PKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQLDGLDWTAQDIVAFLAKHPEDVQSSNGS 


420 






■ 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

h 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 • 1 M M 1 1 1 1 1 1 • 1 1 1 1 h 1 1 1 1 1 1 III 




Db 


421 


PQ I WTFMENS QEMDLVRTLLDSRGNDQ FWEQ KLDGLDWTAQD IMA FLAKNPEDVQS PNGS 


480 


Qy 


421 


VYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTG 


480 




illilllllllllll Itllllklillllllll III llllllllllllllllllllll 

1 1 M 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 IJ 1 1 1 1 1 1 1 1 III M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


VYTWRE AFNETNQA I QT I SRFME CVNLNKLE P I PTE VRL INKSMELLDERKFWAG I VFTG 


540 


Qy 


481 


ITPGS lELPHHVKYKIRMDIDNVERTNKI KDGYWDPGPRADPFEDMRYVWGGFAYLQDW 


540 






III 1 : 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


ITPDSVELPHHVKYKIRMDI DNVERTNKI KDGYWDPGPRADPFEDMRYVWGGFAYLQDW 


600 


Qy 


541 


EQAI IRVLTGTEKKTGVYMQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVI IKGIV 


600 






II II II lllhllllllhlll II IIJIII III Illlllllllllllllll II II II II 




Db 


601 


EQA I IRVLTGSEKKTGVYVQQMP YPC YVDD I FLRVMSRSMPLFMTLAWI YSVAVI I KS I V 


660 


Qy 


601 


YEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 


660 




Illlllllllllllllll llllllhlllllllllllllllllllllllMIIIIIIII 




Db 


661 


YEKEARLKETMRIMGLDNGILWFSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 


720 


Qy 


661 


FLSVFAWTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFAS 


720 






lllll|:|||||||||||IMIIIIIIIIIIIIIIIIIIIMIIIIIIIMIh:|IMI 




Db 


721 


FLSVFAMVTILQCFL I STLFSRANLAAACGGI I YFTLYLPYVLCVA WQDYVGFS I KI FAS 


780 


Qy 


721 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTW 


780 






1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


781 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTW 


840 


Qy 


781 


Y I EAVFPGQYGI PRP WYFPCTKS YWFGEESDEKSHPGSNQKRI SE I CMEEEPTHLKLGVS 


840 






IIIIMIIIIIIIIIIIIIIIIIIIIIII lllllllhll :|llllllllllhllll 




Db 


841 


Y I EAVFPGQYGI PRP WYFPCTKS YWFGEE IDE KSHPGSSQKGVSE I CMEEEPTHLRLGVS 


900 


Qy 


841 


I QNLVKVYRDGMKVA VDGLALNF YEGQ I TS FLGHNGAGKTTTMS I LTGLFP PTSGTA Y I L 


900 



lllllllllllllllllllllllllllllllllllllllllllllillllllllllllll 



Db 


901 


IQNLVK\maX3MKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTA^ 


960 


Qy 


901 


GKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 


960 




IIIIMIIhlllllllllllllllllllllllllllllMllllllllillllllllll 




Db 


961 


GKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 


1020 


Qy 


961 


LPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 


1020 




II lllllllillllllllllllllllllllllllllMIIIIIIIIIIIIIIIIIIIII 




Db 


1021 


LPPSKLKSKTSQLSGOTQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 


1080 


Qy 


1021 


GRTI ILSTHHMDEADVLGDRIAI ISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 


1080 




llllllllllllllhlllillMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


1081 


GRTI ILSTHHMDEADILGDRI AI ISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 


1140 


Qy 


1081 


S CRNSS STVS YLKKEDS VSQSS SDAGLGSDHE SDTLT I DVS A I SNL I RKHVSE ARLVED I 


1140 




llllllllll lllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


1141 


SCRNSSSTVSCLKKEDSVSQSS SDAGLGSDHE SDTLT I DVSA I SNL I RKHVSE ARLVED I 


1200 


Qy 


1141 


GHELTYVLPYEAAKEGAFVELFHE IDDRLSDLGI SS YGI SETTLEE I FLKVAEESGVDAE 


1200 




IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 




Db 


1201 


GHELTYVLPYEAAKEGAFVELFHE IDDRLSDLGI SS YGI SETTLES I FLKVAEESGVDAE 


1260 


Qy 


1201 


TSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVK 


1260 




llllllllllllllllllllll lllllll IIIIIIIIIIIIIIIIMIMIIIIIhl 




Db 


1261 


TSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 


1320 


Qy 


1261 


GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 


1320 




IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 




Db 


1321 


GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 


1380 


Qy 


1321 


PWMYNEQ YTFVSNDA PEDTGTLELLNALTKDPGFGTRCMEGNP I PDTPCQAGEEE WTTA P 


1380 




IIMIIIIIIIIIIMII II lllllllllllllllllllllllllll lllhll :| 




Db 


1381 


PWMYNEQ YTFVSNDA PEDMGTQE LLNALTKDPGFGTRCMEGNP I PDTPCLAGEEDWTI S P 


1440 


Qy 


1381 


VPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDL 


1440 




|||:|:||||l|lllhlllllllllllllllllllllllllllMIIIII lllllhl 




Db 


1441 


VPQS I VDLFQNGNWTMKNPSPACQCSSDKI KKMLPVCPPGAGGLPPPQRKQKTADILQNL 


1500 


Qy 


1441 


TGRNI SDYLVKTYVQ 1 1 AKSLKNKI WVNE FRYGGFSLGVSNTQALPPSQEVNDATKQMKK 


1500 




lllillllMIMiMlllllllllllllliillllllllhiillll Mill lllll 




Db 


1501 


TGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNSQALPPSHEVNDAIKQMKK 


1560 


Qy 


1501 


HLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFLNVINNAILRANLQK 


1560 




III ||:|||llhllllll lllhlllllllllMIIIIIIIIIMIIIIIIIlin 




Db 


1561 


LLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAISSFLNVINNAILRANLQK 


1620 


Qy 


1561 


GENPSHYGI TAFNHPLNLTKQQLSE VA PMTTS VDVLVS I CV I FAMS FVP AS F WFL I QER 


1620 




lllll lllllllllllllllllllll llllllllllllllllllllllllllllllll 




Db 


1621 


GENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFWFLIQER 


1680 


Qy 


1621 


VSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQKSYVSSTNLPVLAL 


1680 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIMIIIIMII 




Db 


1681 


VSKAKHLQFISGVKPVI YWLSNFVWDMCNYWPATLVI I IFICFQQKSYVSSTNLPVLAL 


1740 


Qy 


1681 


LLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATFVLELFTDNKLNNIN 


1740 




III III IMIIIMIIII II lllllllllllllllllllllllllll MM hlllhll 




Db 


1741 


LLLL YGWS I TPLMYPAS FVFKI PSTA YWLTS VNLF I G INGS VATFVLE LFTNNKLNDI N 


1800 


Qy 


1741 


DILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEG 


1800 






lllll III II MM II lllll MM II III lllll III II lllllll II II II II III II 




Db 


1801 


D I LKS VFL I FPHFCLGRGL I DMVKNQAMADALERFGENRFVS PLS WDLVGRNLFAMA VEG 


1860 


Qy 


1801 


WFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 


1360 






IIIIIIIIIIIMIIIIIIII III llllllllllllllllllllllllllllllllll 




Db 


1861 


WFFL ITVL I QYRFF IRPRPVKAiCLPPLNDEDEDVRRERQRILDGGGQNDI LE I KELTKI 


1920 


Qy 


1861 


YRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNRNSILSN 


1920 




llllllllllllhlllllllMlllllllllhlllllllll IIIIIIMhIIIMI 




Db 


1921 


YRRKRKPAVDRICIGIPPGECFGLLGVNGAGKSTTFKMLTGDTPVTRGDAFLNKNSILSN 


1980 


Qy 


1921 


IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEK 


1980 



Db 


1981 


Qy 


1981 


Db 


2041 


Qy 


2041 


Db 


2101 


Qy 


2101 


Db 


2161 


Qy 


2161 


Db 


2221 



lllllllllllllllllllllllllllllllllllllilllll llllllllllllllil 

IHEVHQNMGYCPQFDA ITELLTGREHVEFFALLRGVPEKEVGKFGEWA IRKLGLVKYGEK 2 04 0 

YAGNYSGGNKimjSTAMALIGGPPVVFLDEPTTCaiDPKARRFLWNCALSVVKEGRSVVLT 2 04 0 

II lllllllllllllllllllllllllllllllllllllllllllllhllllllllll 
YASNYSGGNKRIOjSTAMALIGGPPVVFLDEPTTGMDPKARRFLWNCALS IVKEGRSWLT 2100 

SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIVVRIAGSNPDLKPVQDFFG 2100 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIilllllllllllhlll 

SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQEFFG 216 0 
liAFPGSVPKEKHRNMLQYQLPSSLSSIJVRIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2160 

IIIIMI IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

LAFPGSVLKEKHRNMLQYQLPSSLSSLARI FS ILSQSKKRLHIEDYSVSQTTLDQVFVNF 222 0 

AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 22 01 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 
AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2261 



